BIOTECHNOLOGY w-S- 



RAW SEQUENCE LISTING 
ERROR REPORT 




The Biotechnology Systems Branch of the Scientific and Technical Information 
Center (STIC) detected errors when processing the following computer readable 
form: 

Application Serial Number: 

Source: 



Date Processed by STIC: wZ- 4 ?/ 2w / 



THE ATTACHED PRINTOUT EXPLAINS DETECTED ERRORS. 

PLEASE FORWARD THIS INFORMATION TO THE APPLICANT BY EITHER: 

1) INCLUDING A COPY OF THIS PRINTOUT IN YOUR NEXT COMMUNICATION TO THE 
APPLICANT, WITH A NOTICE TO COMPLY or, 

2) TELEPHONING APPLICANT AND FAXING A COPY OF THIS PRINTOUT, WITH A 
NOTICE TO COMPLY 

FOR CRF SUBMISSION QUESTIONS, PLEASE CONTACT MARK SPENCER, 703-308-4212. 



FOR SEQUENCE RULES INTERPRETATION, PLEASE CONTACT ROBERT WAX, 703-308-4216. 
PATENTIN 2.1 e-mail help: patin21help@uspto.gov or phone 703-306-4119 (R. Wax) 
PATENTTN 3.0 e-mail help: patin3help@uspto.gov or phone 703-306-4119 (R. Wax) 

TO REDUCE ERRORED SEQUENCE LISTINGS, PLEASE USE THE CHECKER 
VERSION 3.0 PROGRAM , ACCESSIBLE THROUGH THE U.S. PATENT AND 
TRADEMARK OFFICE WEBSITE. SEE BELOW: 



Checker Version 3.0 

The Checker Version 3.0 application is a state-of the-art Windows based software program 
employing a logical and intuitive user-interface to check whether a sequence listing is in 
compliance with format and content rules. Checker Version 3.0 works for sequence listings 
generated for the original version of 37 CFR §§1.821 - 1.825 effective October 1, 1990 (old 
rules) and the revised version (new rules) effective July 1, 1998 as well as World Intellectual 
Property Organization (WIPO) Standard ST.25. 

Checker Version 3.0 replaces the previous DOS-based version of Checker, and is Y2K- 
compliant. Checker allows public users to check sequence listings in Computer Readable form 
(CRF) before submitting them to the United States Patent and Trademark Office (USPTOV 
Use of Checker prior to filing the sequence listing is expected to result in fewer errored sequence 
listings, thus saving time and money. 



Checker Version 3.0 can be down loaded from the USPTO website at the following address: 



http://www.uspto.gov/web/offices/pac/checker 
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OIPE 



RAW SEQUENCE LISTING DATE: 
PATENT APPLICATION: US/09/876 , 348 TIME: 

Input Set : A:\SEQLIST.txt 

Output Set: N:\CRF3\06272001\I876348.raw 



06/27/2001 
15:25:39 jg/l/U^ 



Does Not Co/f^J^ 
Corrected Diskette Needed 



c--> 



3 

4 
5 
6 

Activity. 
7 

-> 



SEQUENCE LISTING 
(1) GENERAL INFORMATION: 

(i) APPLICANT: Horwath, K. L . , et al . 
(ii) TITLE OF INVENTION: Nucleic Acids Sequences Encoding Type III 

Tenebrio Antifreeze Proteins and Method for Assaying 



C-- 



C-- 

C-- 



c--> 
c--> 

c--> 



c--> 



8 

9 
10 
11 
12 
13 
14 
15 
16 
17 
18 
19 
20 
21 
22 
23 
24 
25 
26 
27 
28 
29 
30 
31 
32 



1.4 Mb storage 



(iii) NUMBER OF SEQUENCES: 48 
(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Dr. Kathleen L. Horwath 

(B) STREET: Department of Biological Sciences, 

(C) CITY: Binghamton 

(D) STATE: New York 
(F) ZIP: 13902-6000 

(V) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Diskette, 3.50 inch, 

(B) COMPUTER: IBM AT/ATX compatible 

(C) OPERATING SYSTEM: Windows 95/98 . 

(D) SOFTWARE: Microsoft Word 
(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US/09/876,34 8 

(B) FILING DATE: 07-Jun-2001 

(C) CLASSIFICATION: 
(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 60210446 

(B) FILING DATE: June 8, 2000 
(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Mark Levy, Attorney -at -Law 

(B) REGISTRATION NUMBER: 29,188 

(C) REFERENCE/DOCKET NUMBER: RB125 
(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 607-722-660 

(B) TELEFAX: 607-724-2207 



Binghamton University 



ERRORED SEQUENCES 

C--> 60 (2) INFORMATION FOR SEQ ID NO: 2 

61 (i) SEQUENCE CHAJ^^RISTICS^— 

62 (A) LENGTH: (566/base pairs 

63 (B) TYPE: nucleic acid 

64 (C) STRANDEDNESS: double 

65 (D) TOPOLOGY: linear 

66 (ii) MOLECULE TYPE: cDNA to mRNA 

67 (iii) HYPOTHETICAL: no 

68 (iv) ANTI-SENSE: no 

69 (vi) ORIGINAL SOURCE: 

70 (A) ORGANISM: Tenebrio molitor 
C--> 71 (C) INDIVIDUAL ISOLATE: none 



file://C:\CRF3\Outhold\VsrI876348.htm 



6/27/01 



RAW SEQUENCE LISTING 

PATENT APPLICATION: US/09/876, 348 



DATE: 06/27/2001 
TIME: 15:25:39 



Input Set : A:\SEQLIST.txt 

Output Set: N:\CRF3\06272001\I876348.raw 



c- 


-> 


72 






73 






74 






75 






76 






78 


E- 


-> 


80 






81 






82 


E- 


-> 


84 






85 






86 


E- 


-> 


88 






89 






90 


E- 


-> 


92 






93 






94 


E- 


-> 


96 






97 






98 


E- 


-> 


100 






101 






102 


E- 


-> 


104 






105 






106 


E- 


-> 


108 






109 






110 


E- 


-> 


112 






113 






114 


E- 


-> 


116 






117 






118 


E- 


-> 


120 


E- 


-> 


122 


E- 


-> 


124 


C- 


-> 


127 






128 






129 






130 






131 






132 






133 






134 






135 






136 



(G) CELL TYPE: fat body and whole organism 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: CDNA 

(B) CLONE: 13.17 I t 4- 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 

GTGGATCCAA AGAATTCGGC ACGAGACTAC TAAG ATG AAG TTG CTC 

Met Lys Leu Leu 
-15 

TGT TGT CTA ATC TCC CTC ATT CTG TTG GTC ACA GTT CAG GCC CTG 

Cys Cys Leu lie Ser Leu lie Leu Leu Val Thr Val Gin Ala Leu 

-10 -5 1 

ACC GAG GCA CAA ATT GAG AAA CTG AAC AAG ATC AGC AAA AAA TGT 
Thr Glu Ala Gin lie Glu Lys Leu Asn Lys lie Ser Lys Lys Cys 

5 10 15 

CAA AAT GAA AGT GGA GTG TCG CAA GAG ATC ATA ACC AAA GCT CGC 
Gin Asn Glu Ser Gly Val Ser Gin Glu lie lie Thr Lys Ala Arg 

20 25 30 

AAC GGT GAC TGG GAG GAC GAT CCT AAA CTG AAA CGC CAA GTT 



97 Asn Gly Asp Trp Glu Asp Asp Pro Lys Leu Lys Arg Gin Val 



TGC GTG 



35 

GCC AGG AAC 



40 

GCC GGT CTG GCC ACG 



45 

GAA TCG GGA GAG 



TTT 

Phe 

GTG 



Cys Val Ala Arg Asn Ala Gly Leu Ala Thr Glu Ser Gly Glu Val 



GTG GTC GAC 



50 
GTG 
Val 
65 



TTG AGG GAG AAG 



55 60 

GTG AGG AAG GTC ACT GAC AAC 



70 75 



GAC GAA GAA ACT GAG AAA ATC ATC AAT AAG TGC GCC GTC AAG AGA 



80 85 
GAT ACT GTT GAA GAG ACG GTG TTC AAT 



90 

ACT TTC AAA TGT GTC ATG 



105 

TGA ACCACCACGA 



95 100 
AAA AAC AAG CCA AAG TTC TCA CCA GTT GAT 
Lys Asn Lys Pro Lys Phe Ser Pro Val Asp 

110 115 
CT AG TAG ATG GTTCAAATGG TGTGCTTTAC ATATAAAAAT AAAGTGTTTC 
TGATGTAAAA AAAAAAAAAA AAAAAAAAAA AACTCGAGAG TATTCTAGAG 
CGGCCGCGGG CCCATCGTTT TCCACCC 
(2) INFORMATION FOR SEQ ID NO: 3 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 134 Amino Acids 

(B) TYPE: Amino Acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 
(ii) MOLECULE TYPE: Protein 

(iii) HYPOTHETICAL: no 
(iv) ANTI- SENSE: no 
(Vi) ORIGINAL SOURCE: 




81 



126 



171 



216 



261 



306 



351 



396 



439 



489 

539 
566 
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RAW SEQUENCE LISTING DATE: 06/27/2001 

PATENT APPLICATION: US/09/876,34 8 TIME: 15:25:39 

Input Set : A:\SEQLIST.txt 

Output Set: N:\CRF3\06272001\I876348.raw 



C- 
C- 



137 
138 
139 
140 
141 
142 
143 
145 
14 7 
148 
150 
E--> 151 
153 
15 
156' 
157 
159 
160 
162 
163 
165 
166 
168 
169 
E--> 171 
E--> 172 
C--> 175 
176 
177 
178 
179 
180 
181 
182 
183 
184 
185 
186 
187 
188 
189 
190 
191 
193 
195 
196 
198 
199 
201 



C-- 
C-- 



(A) ORGANISM: Tenebrio molitor 

(C) INDIVIDUAL ISOLATE: none 

(G) CELL TYPE: fat body and whole 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: cDNA 

(B) CLONE: 13.17 

(jJc4-PEft¥tm£S 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3: 
Met Lys Leu Leu Cys Cys Leu lie Ser Leu He 

-15 -10 
Gin Ala Leu, Thr Glu Ala Gin He Glu Lys Leu 

Lys Cys Gin Asn Glu Ser Gly Val Ser Gin Glu 

15 20 25 

ry Ami Gly_ Asp xrp Glu Asp Asp Pro Lys Leu" 
35 40 



organism 



Leu Leu Val 
-5 

Asn Lys He 
te- 
lle He Thr 



Cys Val Ala Arg Asn Ala Gly Leu Ala Thr Glu 

50 55 
Val Asp VaT~ Xeu Arg Glu Lys Val flr gn^sTal 
C 65 70 

85 

r air^Phe^As-n— T-h-r^he~I^^Cys^ Va 1 






'JIM. 



Lys Phe Ser Pro Val Asp| 
115 

(2) INFORMATION FOR SEQ ID NO: 4 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 116 Amino Acids 

(B) TYPE: Amino Acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
MOLECULE TYPE: Protein 
HYPOTHETICAL: no 
ANTI- SENSE: no 
ORIGINAL SOURCE: 

(A) ORGANISM: Tenebrio molitor 
(C) INDIVIDUAL ISOLATE: none 
(G) CELL TYPE: fat body and whole organism 
IMMEDIATE SOURCE: 

(A) LIBRARY: cDNA 

(B) CLONE: 13.17 
Jj Y. ) FEATURES"*** 

SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
Leu Thr Glu Ala Gin lie Glu Lys Leu Asn Lys lie Ser Lys 
15 10 

Gin Asn Glu Ser Gly Val Ser Gin Glu lie lie Thr Lys Ala 

20 25 30 

Gly Asp Trp Glu Asp Asp Pro Lys Leu Lys Arg Gin Val Phe 



(ii) 
(iii) 
(iv) 
(vi) 



(vii) 



(xi) 




Lys Cys 
15 

Arg Asn 
Cys Val 
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RAW SEQUENCE LISTING DATE: 06/27/2001 

PATENT APPLICATION: US/09/876,34 8 TIME: 15:25:39 



A:\SEQLIST.txt 

N:\CRF3\06272001\I876348.raw 

40 45 
Ala Thr Glu Ser Gly Glu Val Val Val Asp 
55 60 

Arg Lys Val Thr Asp Asn Asp Glu Glu Thr 

75 80 
Cys Ala Val Lys Arg Asp Thr Val Glu Glu 

90 95 
Lys Cys Val Met Lys Asn Lys Pro Lys Phe 
105 110 



ID NO: 7 

345 (i) SEQUENCE CHARACTERISTICS: 

346 (A) LENGTH: 133 Amino Acids 

347 (B) TYPE: Amino Acid 

34 8 .(C) STRANDEDNESS : single 

349 (D) TOPOLOGY: linear 

350 (ii) MOLECULE TYPE: Protein 
3 51 (iii) HYPOTHETICAL: no 

352 (iv) ANTI-SENSE: no 

353 (vi) ORIGINAL SOURCE: 

354 (A) ORGANISM: Tenebrio molitor 
C--> 355 (C) INDIVIDUAL ISOLATE: none 

C — > 356 (G) CELL TYPE: fat body and whole organism 

357 (vii) ' IMMEDIATE SOURCE: 

358 (A) LIBRARY: cDNA 

359 (B) CLONE: 2.2, 2.3, and 7.5 

360 ^f ix ) FEATURES — t 

362 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7: 





364 


Met 


Lys 


Leu 


Leu 


Leu 


Cys 


Phe 


Ala 


Phe 


Ala 


Ala 


He 


Val 


He 


Gly Ala 




365 








-15 










-10 










-5 








367 


Gin 


Ala 


Leu 


Thr 


Asp 


Glu 


Gin 


He 


Gin 


Lys 


Arg 


Asn 


Lys 


He 


Ser 


Lys 




368 






1 








5 










10 












370 


Glu 


Cys 


Gin 


Gin 


Val 


Ser 


Gly 


Val 


Ser 


Gin 


Glu 


Thr 


He 


Asp 


Lys 


Val 




371 


15 










20 










25 










30 




373 


Arg 


Thr 


Gly 


Val 


Leu 


Val 


Asp 


Asp 


Pro 


Lys 


Met 


Lys 


Lys 


His 


Val 


Leu 




374 










35 










40 










45 






376 


Cys 


Phe 


Ser 


Lys 


Lys 


Thr 


Gly 


Val 


Ala 


Thr 


Glu 


Ala 


Gly 


Asp 


Thr 


Asn 




377 








50 










55 










60 








379 


Val 


Glu 


Val 


Leu 


Lys 


Ala 


Lys 


Leu 


Lys 


His 


Val 


Ala 


Ser 


Asp 


Glu 


Glu 




380 






65 










70 










75 










382 


Val 


Asp 


Lys 


He 


Val 


Gin 


Lys 


Cys 


Val 


Val 


Lys 


Lys 


Ala 


Thr 


Pro 


Glu 




383 




80 










85 










90 












385 


Glu 


Thr 


Ala 


Tyr 


Asp 


Thr 


Phe 


Lys 


Cys 


He 


Tyr 


Asp 


Ser 


Lys 


Pro 


Asp 




386 


95 




















105 










110 


E--> 


388 


Phe 


Ser 


Pro 


lie 


Asp 
























E--> 


389 










115 
























C--> 


392 


(2) 


INFORMATION 


FOR 


SEQ 


ID NO: 8 

















393 (i) SEQUENCE CHARACTERISTICS: 
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Input Set : 
Output Set: 





202 






35 








204 


Ala 


Arg 


Asn 


Ala Gly 


Leu 




205 




50 










207 


Val 


Leu 


Arg 


Glu Lys 


Val 




208 


65 








70 




210 


Glu 


Lys 


He 


He Asn 


Lys 




211 








85 






213 


Thr 


Val 


Phe 


Asn Thr 


Phe 




214 








100 J 




E--> 


216 


Ser 


Pro 


Val 


Asp/*) 




E--> 


217 






115 






C--> 


344 


(2) 


INFORMATION FOR 


SEQ 



RAW SEQUENCE LISTING 

PATENT APPLICATION: US/09/876,34 8 



DATE : 06/27/2001 
TIME: 15:25:39 



Input Set : A:\SEQLIST.txt 

Output Set: N:\CRF3\06272001\I876348.raw 

394 (A) LENGTH: 115 Amino Acids 

395 (B) TYPE: Amino Acid 

396 (C) STRANDEDNESS: single 

397 (D) TOPOLOGY: linear 

398 (ii) MOLECULE TYPE: Protein 
3 99 (iii) HYPOTHETICAL: no 

400 (iv) ANTI-SENSE: no 

401 (vi) ORIGINAL SOURCE: 

402 (A) ORGANISM : Tenebrio molitor 
C--> 403 (C) INDIVIDUAL ISOLATE: none 

C--> 404 (G) CELL TYPE: fat body and whole organism 

405 (vii) IMMEDIATE SOURCE: 

406 (A) LIBRARY: CDNA 

407 (B) CLONE: 2.2, 2.3, and 7.5 

408 ^iJijrEATURgJ 1 

410 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

412 Leu Thr Asp Glu Gin lie Gin Lys Arg Asn Lys lie Ser Lys Glu Cys 

413 15 10 15 

415 Gin Gin Val Ser Gly Val Ser Gin Glu Thr lie Asp Lys Val Arg Thr 

416 20 25 30 

418 Gly Val Leu Val Asp Asp Pro Lys Met Lys Lys His Val Leu Cys Phe 

419 35 40 45 

421 Ser Lys Lys Thr Gly Val Ala Thr Glu Ala Gly Asp Thr Asn Val Glu 

422 50 55 60 

424 Val Leu Lys Ala Lys Leu Lys His Val Ala Ser Asp Glu Glu Val Asp 

425 65 70 75 80 

427 Lys He Val Gin Lys Cys Val Val Lys Lys Ala Thr Pro Glu Glu Thr 

428 85 90 95 

430 Ala Tyr Asp Thr Phe Lys Cys He Tyr Asp Ser Lys Pro Asp Phe Ser 

431 106 105 110 



E--> 4 33 Pro He Asp\ 

E--> 434 115 
C--> 500 (2) INFORMATION FOR SEQ ID NO: 10 

501 (i) SEQUENCE CHARACTERISTICS: 

502 (A) LENGTH: 133 Amino Acids * 

503 (B) TYPE: Amino Acid / 

504 (C) STRANDEDNESS: single // w 

505 (D) TOPOLOGY: linear 

506 (ii) MOLECULE TYPE: Protein 

507 (iii) HYPOTHETICAL : no 

508 (iv) ANTI-SENSE: no 

509 (vi) ORIGINAL SOURCE: 

510 (A) ORGANISM: Tenebrio molitor 
C~-> 511 (C) INDIVIDUAL ISOLATE: none 
C--> 512 (G) CELL TYPE: fat body and whole organism 

513 (vii) IMMEDIATE SOURCE: 

514 (A) LIBRARY: CDNA 

515 (B) CLONE: 3.4 

516 ^^Jjt^PSftTtTKES"" 
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RAW SEQUENCE LISTING DATE: 06/27/2001 

PATENT APPLICATION: US/09/876,34 8 TIME: 15:25:39 



Input Set : A:\SEQLIST.txt 

Output Set: N:\CRF3\06272001\I876348.raw 

518 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

520 Met Lys Leu Leu Leu Cys Phe Ala Phe Ala Ala lie Val lie Gly Ala 

521 -15 -10 -5 

523 Gin Ala Leu Thr Asp Glu Gin lie Gin Lys Arg Asn Lys lie Ser Lys 

524 1 5 10 

526 Glu Cys Gin Gin Val Ser Gly Val Ser Gin Glu Thr He Asp Lys Val 

527 15 20 25 30 
52 9 Arg Thr Gly Val Leu Val Asp Asp Pro Lys Met Lys Lys His Val Leu 
530 35 40 45 

532 Cys Phe Ser Lys Lys Thr Gly Val Ala Thr Glu Ala Gly Asp Thr Asn 

533 50 55 60 

535 Val Glu Val Leu Lys Ala Lys Leu Lys His Val Ala Ser Asp Glu Glu 

536 65 70 75 

538 Val Asp Lys He Val Gin Lys Cys Val Val Lys Lys Ala Thr Pro Glu 

539 80 85 90 

541 Glu Thr Ala Tyr Asp Thr Phe Lys Val He Tyr Asp Ser Lys Pro Asp 

542 95 JbM 105 110 
E--> 544 Phe Ser Pro He Asp 
E--> 545 115 

C--> 54 8 (2) INFORMATION FOR SEQ ID NO: 11 

549 (i) SEQUENCE CHARACTERISTICS: 

550 (A) LENGTH : 115 Amino Acids 

551 (B) TYPE: Amino Acid 

552 (C) STRANDEDNESS : single 

553 (D) TOPOLOGY: linear 
5 54 (ii) MOLECULE TYPE: Protein 

555 (iii) HYPOTHETICAL: no 

556 (iv) ANTI-SENSE: no 

557 (vi) ORIGINAL SOURCE: 

558 (A) ORGANISM: Tenebrio molitor 
C--> 559 (C) INDIVIDUAL ISOLATE: none 
C--> 560 (G) CELL TYPE: fat body and whole organism 

561 (vii) IMMEDIATE SOURCE: 

562 (A) LIBRARY: CDNA 

563 (B) CLONE: 3.4 

564 (4^££ft¥tfRES- 

566 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

568 Leu Thr Asp Glu Gin He Gin Lys Arg Asn Lys lie Ser Lys Glu Cys 

569 1 5 10 15 

571 Gin Gin Val Ser Gly Val Ser Gin Glu Thr He Asp Lys Val Arg Thr 

572 20 25 30 

574 Gly Val Leu Val Asp Asp Pro Lys Met Lys Lys His Val Leu Cys Phe 

575 35 40 45 

577 Ser Lys Lys Thr Gly Val Ala Thr Glu Ala Gly Asp Thr Asn Val Glu 

578 50 55 60 

580 Val Leu Lys Ala Lys Leu Lys His Val Ala Ser Asp Glu Glu Val Asp 

581 65 70 75 80 

583 Lys He Val Gin Lys Cys Val Val Lys Lys Ala Thr Pro Glu Glu Thr 

584 85 90 95 
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RAW SEQUENCE LISTING DATE: 06/27/2001 

PATENT APPLICATION; US/09/876 f 34 8 TIME: 15:25:39 

Input Set : A:\SEQLIST.txt 

Output Set: N:\CRF3\06272001\I876348.raw 

586 Ala Tyr Asp Thr Phe Lys Val lie Tyr Asp Ser Lys Pro Asp Phe Ser 

587 Jj/o 105 110 
E--> 589 Pro lie Asp (* j 

E--> 590 115 ^> 

C--> 656 <2) INFORMATION FOR SEQ ID NO: 13 

657 (i) SEQUENCE CHARACTERISTICS: 

658 (A) LENGTH: 133 Amino Acids 

659 (B) TYPE: Amino Acid 

660 (C) STRANDEDNESS: single 

661 (D) TOPOLOGY: linear 

662 (ii) MOLECULE TYPE: Protein 

663 (iii) HYPOTHETICAL: no 

664 (iv) ANTI -SENSE: no 

665 (vi) ORIGINAL SOURCE: 

666 (A) ORGANISM: Tenebrio molitor 
C--> 667 (C) INDIVIDUAL ISOLATE: none 

C--> 668 (G) CELL TYPE: fat body and whole organism 

669 (Vii) IMMEDIATE SOURCE: 

670 (A) LIBRARY: CDNA 

671 (B ) CLONE : 3.9 

672 (.1 YjFFlATnpnn-* 

674 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

676 
677 
679 
680 
682 
683 
685 
686 
688 
689 



Met 


Lys 


Leu 


Leu 


Leu 


Cys 


Phe 


Ala 


Phe 


Ala Ala He Val 


He 


Gly Ala 








-15 










-10 




-5 






Gin 


Ala 


Leu 
1 


Thr 


Asp 


Glu 


Gin 
5 


He 


Gin 


Lys Arg Asn Lys 
10 


He 


Ser 


Lys 


Glu 


Cys 


Gin 


Gin 


Glu 


Ser 


Gly 


Val 


Ser 


Gin Glu Thr He 


Asp 


Lys 


Val 


15 










20 








25 






30 


Arg 


Thr Gly 


Val 


Leu 


Val 


Asp 


Asp 


Pro 


Lys Met Lys Lys 


His 


Val 


Leu 










35 










40 




45 




Cys 


Phe 


Ser 


Lys 
50 


Arg 


Thr 


Gly 


Val 


Ala 
55 


Thr Glu Ala Gly 


Asp 
60 


Thr 


Asn 


Val 


Glu 


Val 


Leu 


Lys 


Ala 


Lys 


Leu 


Lys 


HisVal Ala Ser I 
T 75 


\sp Glu Glu 






65 










70 










Val 


Asp 
80 


Lys 


He 


Val 


Gin 


Lys 
85 


Cys 


Val 


Val Lys Lys Ala 
90 


Thr 


Pro 


Glu 


Glu 


Thr 


Ala 


Tyr 


Asp 


Thr 


Phe 


Lys 


Cys 


He Tyr Asp Ser 


Lys 


Pro 


Asp 


95 


















105 






110 


Phe 


Ser 


Pro 


He 


Asp( 
115 


3 

















E--> 691 
E--> 692 

694 
E--> 695 

697 
E--> 698 
E--> 700 
E--> 701 

C--> 704 (2) INFORMATION FOR SEQ ID NO: 14 

705 (i) SEQUENCE CHARACTERISTICS: 

706 (A) LENGTH: 115 Amino Acids 

707 (B) TYPE: Amino Acid 

708 (C) STRANDEDNESS: single 

709 (D) TOPOLOGY: linear 

710 (ii) MOLECULE TYPE: Protein 

711 (iii) HYPOTHETICAL: no 

712 (iv) ANTI-SENSE: no 



Qt 
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RAW SEQUENCE LISTING DATE: 06/27/2001 

PATENT APPLICATION: US/09/876, 34 8 TIME: 15:25:39 



Input Set : A:\SEQLIST.txt 
Output Set: N:\CRF3\06272001\I876348.raw 

713 (vi) ORIGINAL SOURCE: 

714 (A) ORGANISM: Tenebrio molitor 
C--> 715 (C) INDIVIDUAL ISOLATE: none 
C--> 716 (G) CELL TYPE: fat body and whole organism 

717 (vii) IMMEDIATE SOURCE: 

718 (A) LIBRARY: CDNA 

719 (B) CLONE: 3.9 

720 (4iH^^T^ES 
722 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

724 Leu Thr Asp Glu Gin lie Gin Lys Arg Asn Lys lie Ser Lys Glu Cys 

725 15 10 15 

727 Gin Gin Glu Ser Gly Val Ser Gin Glu Thr lie Asp Lys Val Arg Thr 

728 20 25 30 

730 Gly Val Leu Val Asp Asp Pro Lys Met Lys Lys His Val Leu Cys Phe 

731 35 40 45 

733 Ser Lys Arg . Thr Gly Val Ala Thr Glu Ala Gly Asp Thr Asn Val Glu 

734 50 55 J5f%<JL> 60 
E--> 736 Val Leu Lys Ala Lys Leu Lys His val Ala Ser Asp Glu Glu Val Asp 
E--> 737 65 70 75 80 

739 Lys lie Val Gin Lys Cys Val Val Lys Lys Ala Thr Pro Glu Glu Thr 
E--> 740 85 90 95 

742 Ala Tyr Asp Thr Phe Lys Cys lie Tyr Asp Ser Lys Pro Asp Phe Ser 
E--> 743 1A0- 105 110 

E--> 745 Pro He Asp 
E--> 746 115 
C--> 749 (2) INFORMATION FOR SEQ ID NO: 15 

750 (i) SEQUENCE CHARACTERISTICS: 

751 (A) LENGTH: 481 base pairs 

752 (B) TYPE: nucleic acid 

753 (C) STRANDEDNESS : double 

754 (D) TOPOLOGY: linear 

755 (ii) MOLECULE TYPE: cDNA to mRNA 

756 (iii) HYPOTHETICAL: no 

757 (iv) ANTI-SENSE: no 

758 (vi) ORIGINAL SOURCE: 

759 (A) ORGANISM: Tenebrio molitor 
C-->760 (C) INDIVIDUAL ISOLATE: none 

C--> 761 (G) CELL TYPE: fat body and whole organism 

762 (vii) IMMEDIATE SOURCE: 

763 (A) LIBRARY: cDNA 

764 (B) CLONE: 7.5 

765 („i y)FFATT,TFFS~* 

767 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

769 GGCACGAGCA AAA ATG AAA CTC CTC TTG TGC TTT GCG TTC GCC GCC 46 

770 Met Lys Leu Leu Leu Cys Phe Ala Phe Ala Ala 

771 -15 -10 
E--> 773 ATC GTC ATC GGA GCT CAG GCT CTC ACC GAC GAA CAG ATA CAG AAA 

774 lie Val lie Gly Ala Gin Ala Leu Thr Asp Glu Gin He Gin Lys 

775 -5 1 5 
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777 


AGG 


AAC 


AAG 


ATC 


AGC 


AAA 


GAG 


TGC 


CAG 


CAG 


GTG 


TCC 


GGA 


GTG 


TCC 


136 


778 


Arg 


Asn 


Lys 


He 


Ser 


Lys 


Glu 


Cys 


Gin 


Gin 


Val 


Ser 


Gly 


Val 


Ser 




779 




10 










15 










20 










781 


CAA 


GAG 


ACG 


ATC 


GAC 


AAA 


GTC 


CGC 


ACA 


GGT 


GTC 


TTG 


GTC 


GAC 


GAT 


181 


782 


Gin 


Glu 


Thr 


He 


Asp 


Lys 


Val 


Arg 


Thr 


Gly 


Val 


Leu 


Val 


Asp Asp 




783 




25 










30 










35 










785 


CCC 


AAA 


ATG 


AAG 


AAG 


CAC 


GTC 


CTC 


TGC 


TTC 


TCG 


AAG 


AAA 


ACT 


GGA 


226 


786 


Pro 


Lys 


Met 


Lys 


Lys 


His 


Val 


Leu 


Cys 


Phe 


Ser 


Lys 


Lys 


Thr Gly 




787 




40 










45 










50 










789 


GTG 


GCA 


ACC 


GAA 


GCC 


GGA 


GAC 


ACC 


AAT 


GTG 


GAG 


GTA 


CTC 


AAA 


GCC 


271 


790 


Val 


Ala 


Thr 


Glu 


Ala 


Gly 


Asp 


Thr 


Asn 


Val 


Glu 


Val 


Leu 


Lys 


Ala 




791 




55 










60 










65 










793 


AAG 


CTG 


AAG 


CAT 


GTG 


GCC 


AGC 


GAC 


GAA 


GAG 


GTG 


GAC 


AAG 


ATC 


GTG 


316 


794 


Lys 


Leu 


Lys 


His 


Val 


Ala 


Ser 


Asp 


Glu 


Glu 


Val 


Asp 


Lys 


He 


Val 




795 




70 










75 










80 










797 


CAG 


AAG 


TGC 


GTG 


GTC 


AAG 


AAG 


GCC 


ACA 


CCA 


GAG 


GAA 


ACG 


GCT 


TAT 


361 


798 


Gin 


Lys 


Cys 


Val 


Val 


Lys 


Lys 


Ala 


Thr 


Pro 


Glu 


Glu 


Thr 


Ala 


Tyr 




799 




85 










90 










95 










801 


GAC 


ACC 


TTC 


AAG 


TGT 


ATT 


TAC 


GAC 


AGT 


AAA 


CCT 


GAT 


TTC 


TCT 


CCT 


406 


802 


Asp 


Thr 


Phe 


Lys 


Cys 


He 


Tyr 


Asp 


Ser 


Lys 


Pro 


Asp 


Phe 


Ser 


Pro 




803 




100 










105 










110 











805 ATT GAT TAA TTGTTTTGTA TTTGGCTGAA TTTTGACAAT AAAGGTACTA 455 

806 lie Asp * 

807 115 

809 TCGTTATGTA AAAAAAAAAA AAAAAA 481 
C--> 888 (2) INFORMATION FOR SEQ ID NO: 17 

889 (i) SEQUENCE r.HAR^/ra RTSTTC.fi.:. ft J CjH 

890 (A) LENGTH: (l 6 <OVmino Acids ' ' ^ 

891 (B) TYPE: Amino Acid 



i/o 



892 (C) STRANDEDNESS : single 

893 (D) TOPOLOGY: linear 

894 (ii) MOLECULE TYPE: Protein 

895 (Hi) HYPOTHETICAL: no 

896 (iv) ANTI-SENSE: no 

897 (vi) ORIGINAL SOURCE: 

898 (A) ORGANISM: Tenebrio molitor 
C--> 899 (C) INDIVIDUAL ISOLATE: none 

C--> 900 (G) CELL TYPE: fat body and whole organism 

901 (vii) IMMEDIATE SOURCE: 

902 (A) LIBRARY: CDNA 

903 (B) CLONE: 2.2 

904 jti.T) FEATURES — * 

906 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

908 Met Gly Ser Ser His His His His His His Ser Ser Gly Leu Val Pro 

909 -55 -50 -45 

911 Arg Gly Ser His Met Ala Ser Met Thr Gly Gly Gin Gin Met Gly Arg 

912 -40 -35 -30 

914 Gly Ser Glu Phe Ala Arg Ala Lys Met Lys Leu Leu Leu Cys Phe Ala 

915 -25 -20 -15 
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E--> 
E--> 
C--> 



C--> 
C--> 



917 
918 
920 
921 
923 
924 
926 
927 
929 
930 
932 
933 
935 
936 
938 
939 
1009 
1010 
1011 
1012 
1013 
1014 
1015 
1016 
1017 
1018 
1019 
1020 
1021 
1022 
1023 
1024 
1025 
1027 



Phe Ala 
-10 

Gin Lys 

Ser Gin 

Pro Lys 
40 



Ala 

Arg 

Glu 

25 

Met 



Asn 

10 

Thr 



Lys 



lie Val He Gly Ala Gin Ala Leu Thr Asp Glu Gin He 
-5 15 
He Ser Lys Glu Cys Gin Gin Val Ser Gly Val 

15 20 
Asp Lys Val Arg Thr Gly Val Leu Val Asp Asp 
30 

Leu Cys Phe 



He 



35 



Lys Lys 



Ala 
55 
Lys 

Val 

Cys 

(2) 



Thr Glu Ala Gly 



His Val Ala 



Ser 
75 
Ala 



His Val 
45 

Asp 
60 
Asp 

Thr 



Ser Lys Lys Thr Gly Val 
50 

Thr Asn Val Glu Val Leu Lys Ala Lys Leu 

65 70 
Glu Glu Val Asp Lys He Val Gin Lys Cys 

80 85 
Pro Glu Glu Thr Ala Tyr Asp Thr Phe Lys 
95 



Val Lys Lys 
90 

He Tyr Asp Ser Lys Pro Asp Phe Ser Pro He Asp 
105 110 115 

INFORMATION FOR SEQ ID NO: 19 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 9 Amino Acids 

(B) TYPE: Amino Acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: Protein 

(iii) HYPOTHETICAL: no 
(iv) ANTI-SENSE: no 
(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Tenebrio molitor 
(C) INDIVIDUAL ISOLATE: none 
(G) CELL TYPE: fat body and whole organism 
(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: CDNA 

(B) CLONE: 2.2 

(i l l) FEATURED 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 



1029 


Met 


Gly 


Ser 


Ser 


His 


His 


His 


His 


His 


His 


Ser 


Ser 


Gly 


Leu 


Val 


Pro 


1030 










-30 










■25 










-20 




1032 


Arg 


Gly 


Ser 


His 


Met 


Ala 


Ser 


Met 


Thr 


Gly 


Gly 


Gin 


Gin 


Met 


Gly 


Arg 


1033 








-15 










-10 










-5 






1035 


Gly 


Ser 


Leu 


Thr 


Asp 


Glu 


Gin 


He 


Gin 


Lys 


Arg 


Asn 


Lys 


He 


Ser 


Lys 


1036 






1 








5 










10 










1038 


Glu 


Cys 


Gin 


Gin 


Val 


Ser 


Gly 


Val 


Ser 


Gin 


Glu 


Thr 


lie 


Asp 


Lys 


Val 


1039 


15 










20 










25 










30 


1041 


Arg 


Thr Gly 


Val 


Leu 


Val 


Asp 


Asp 


Pro 


Lys 


Met 


Lys 


Lys 


His 


Val 


Leu 


1042 










35 










40 










45 




1044 


Cys 


Phe 


Ser 


Lys 


Lys 


Thr 


Gly 


Val 


Ala 


Thr 


Glu 


Ala 


Gly 


Asp 


Thr 


Asn 


1045 








50 










55 










60 






1047 


Val 


Glu 


Val 


Leu 


Lys 


Ala 


Lys 


Leu 


Lys 


His 


Val 


Ala 


Ser 


Asp 


Glu 


Glu 


1048 






65 










70 










75 








1050 


Val 


Asp 


Lys 


He 


Val 


Gin 


Lys 


Cys 


Val 


Val 


Lys 


Lys 


Ala 


Thr 


Pro 


Glu 
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1051 80 85 90 

1053 Glu Thr Ala Tyr Asp Thr Phe Lys Cys lie Tyr Asp Ser Lys Pro Asp 

1054 95 lOt) 105 110 
E--> 1056 Phe Ser Pro lie AspA 
E~-> 1057 115VJ 
C--> 1136 (2) INFORMATION FOR SEQ ID NO: 

1137 (i) SEQUENCE CHAE 

1138 (A) LENGTH :(l69 jkmino Acids 

1139 (B) TYPE: AnfrTIo Acid 

1140 (C) STRANDEDNESS : single 

1141 (D) TOPOLOGY : linear 

1142 (ii) MOLECULE TYPE: Protein 
114 3 (iii) HYPOTHETICAL: no 
1144 (iv) ANTI-SENSE: no 
114 5 (Vi) ORIGINAL SOURCE: 
1146 (A) ORGANISM: Tenebrio molitor 

C--> 1147 (C) INDIVIDUAL ISOLATE: none 

C-->1148 (G) CELL TYPE: fat body and whole organism 

1149 (vii) IMMEDIATE SOURCE: 

1150 (A) LIBRARY: CDNA 

1151 (B) CLONE: 2.3 

1152 f4^^£ftTUKh]S 
1154 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

1156 Met Gly Ser Ser His His His His His His Ser Ser Gly Leu Val Pro 

1157 -55 -50 -45 

1159 Arg Gly Ser His Met Ala Ser Met Thr Gly Gly Gin Gin Met Gly Arg 

1160 -40 -35 -30 

1162 Gly Ser Glu Phe Ala Arg Ala Lys Met Lys Leu Leu Leu Cys Phe Ala 

1163 -25 -20 -15 

1165 Phe Ala Ala He Val He Gly Ala Gin Ala Leu Thr Asp Glu Gin He 

1166 -10 -5 15 

1168 Gin Lys Arg Asn Lys He Ser Lys Glu Cys Gin Gin Val Ser Gly Val 

1169 10 15 20 

1171 Ser Gin Glu Thr He Asp Lys Val Arg Thr Gly Val Leu Val Asp Asp 

1172 25 30 35 

1174 Pro Lys Met Lys Lys His Val Leu Cys Phe Ser Lys Lys Thr Gly Val 

1175 40 45 50 

1177 Ala Thr Glu Ala Gly Asp Thr Asn Val Glu Val Leu Lys Ala Lys Leu 

1178 55 60 65 70 

1180 Lys His Val Ala Ser Asp Glu Glu Val Asp Lys He Val Gin Lys Cys 

1181 75 80 85 

1183 Val Val Lys Lys Ala Thr Pro Glu Glu Thr Ala Tyr Asp Thr Phe Lys 

1184 90 95 
E--> 1186 Cys He Tyr Asp Ser Lys Pro Asp Phe Ser Pro He Asp 
E--> 1187 105 110 115 
C--> 1257 (2) INFORMATION FOR SEQ ID NO: 23 

1258 (i) SEQUENCE CHARACTERISTICS: 

1259 (A) LENGTH: 149 Amino Acids 

1260 (B) TYPE: Amino Acid 



Thr 
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1261 (C) STRANDEDNESS: single 

1262 (D) TOPOLOGY : linear 

1263 (ii) MOLECULE TYPE: Protein 

1264 (iii) HYPOTHETICAL: no 

1265 (iv) ANTI-SENSE: no 

1266 <vi) ORIGINAL SOURCE: 

1267 (A) ORGANISM: Tenebrio molitor 
C--> 1268 (C) INDIVIDUAL ISOLATE: none 

C-->1269 (G) CELL TYPE: fat body and whole organism 

1270 (Vii) IMMEDIATE SOURCE: 

1271 (A) LIBRARY: cDNA 

1272 (B) CLONE: 2.3 

1273 ( 4jO FEATURES • 

1275 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 



1277 


Met Gly Ser 


Ser 


His 


His 


His 


His 


His 


His 


Ser 


Ser 


Gly 


Leu 


Val 


Pro 


1278 








-30 










-25 










-20 




1280 


Arg 


Gly Ser 


His 


Met 


Ala 


Ser 


Met 


Thr 


Gly 


Gly 


Gin 


Gin 


Met 


Gly 


Arg 


1281 






-15 










-10 










-5 






1283 


Gly 


Ser Leu 


Thr 


Asp 


Glu 


Gin 


He 


Gin 


Lys 


Arg 


Asn 


Lys 


He 


Ser 


Lys 


1284 




1 








5 










10 










1286 


Glu 


Cys Gin 


Gin 


Val 


Ser 


Gly 


Val 


Ser 


Gin 


Glu 


Thr 


He 


Asp 


Lys 


Val 


1287 


15 








20 










25 










30 


1289 


Arg 


Thr Gly 


Val 


Leu 


Val 


Asp 


Asp 


Pro 


Lys 


Met 


Lys 


Lys 


His 


Val 


Leu 


1290 








35 










40 










45 




1292 


Cys 


Phe Ser 


Lys 


Lys 


Thr 


Gly 


Val 


Ala 


Thr 


Glu 


Ala 


Gly 


Asp 


Thr 


Asn 


1293 






50 










55 










60 






1295 


Val 


Glu Val 


Leu 


Lys 


Ala 


Lys 


Leu 


Lys 


His 


Val 


Ala 


Ser 


Asp 


Glu 


Glu 


1296 




65 










70 










75 








1298 


Val 


Asp Lys 


He 


Val 


Gin 


Lys 


Cys 


Val 


Val 


Lys 


Lys 


Ala 


Thr 


Pro 


Glu 


1299 




80 








85 










90 










1301 


Glu 


Thr Ala 


Tyr 


Asp 


Thr 


Phe 


Lys 


Cys 


He 


Tyr 


Asp 


Ser 


Lys 


Pro 


Asp 


1302 


95 


















105 










110 




E--> 1304 Phe Ser Pro He Asp\*| 

E--> 1305 115 
C--> 1308 (2) INFORMATION FOR SEQ ID NO 

1309 (i) SEQUENCE CHARAQffefelS 

1310 (A) LENGTH: (ill j jase pairs 

1311 (B) TYPE: nucleic acid 

1312 (C) STRANDEDNESS: double 

1313 (D) TOPOLOGY: linear 

1314 (ii) MOLECULE TYPE: CDNA to mRNA 

1315 (iii) HYPOTHETICAL: no 

1316 (iv) ANTI-SENSE: no 

1317 (vi) ORIGINAL SOURCE: 

1318 (A) ORGANISM: Tenebrio molitor 
C--> 1319 (C) INDIVIDUAL ISOLATE: none 

C--> 1320 (G) CELL TYPE: fat body and whole organism 

1321 (vii) IMMEDIATE SOURCE: 

1322 (A) LIBRARY: cDNA 
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1323 






(B) CLONE: 


: 13. 


.17 




















1324 










(ix) FEATURES 
















1327 




(xi) 


SEQUENCE DESCRIPTION: SEQ ID NO: 24: 












1329 




TTGTTAGCGG ATGGAATTCC CTCGTAGGGG ATAATTTTGT 


TTACTTTAAG 


50 




1331 


AAGGAGATAT ACC ATG GGC AGC AGC CAT CAT CAT CAT CAT CAC AGC 


96 




1332 








Met Gly Ser Ser His His His His His His Ser 






1333 








-65 








,-60 










■55 






1335 


AGC 


GGC 


CTG 


GTG CCG 


CGC 


GGC 


AGC 


CAT ATG 


GCT 


AGC 


ATG 


ACT 


GGT 


141 




1336 


Ser 


Gly 


Leu 


Val Pro 


Arg 


Gly 


Ser 


His Met 


Ala 


Ser 


Met 


Thr 


Gly 






1337 








-50 








-45 










-40 






1339 


GGA 


CAG 


CAA 


ATG GGT 


CGC 


GGA 


TCC 


GAA TTC 


TGG 


ATC 


CAA 


AGA 


ATT 


186 




1340 


Gly 


Gin 


Gin 


Met Gly 


Arg 


Gly 


Ser 


Glu Phe 


Trp 


lie 


Gin 


Arg 


He 






1341 








-35 








-30 










-25 






1343 


CGG 


CAC 


GAG 


ACT ACT 


AAG 


ATG 


AAG 


TTG CTC 


TGT 


TGT 


CTA 


ATC 


TCC 


231 




1344 


Arg 


His 


Glu 


Thr Thr 


Lys 


Met 


Lys 


Leu Leu 


Cys 


Cys 


Leu 


lie 


Ser 






1345 








-20 








-15 










-10 






1348 


CTC 


ATT 


CTG 


TTG GTC 


ACA 


GTT 


CAG 


GCC CTG 


ACC 


GAG 


GCA 


CAA 


ATT 


276 




1349 


Leu 


lie 


Leu 


Leu Val 


Thr 


Val 


Gin 


Ala Leu 


Thr 


Glu 


Ala 


Gin 


He 






1350 








-5 








1 








5 








1352 


GAG 


AAA 


CTG 


AAC AAG 


ATC 


AGC 


AAA 


AAA TGT 


CAA 


AAT 


GAA 


AGT 


GGA 


321 




1353 


Glu 


Lys 


Leu 


Asn Lys 


lie 


Ser 


Lys 


Lys Cys 


Gin 


Asn 


Glu 


Ser 


Gly 






1354 








10 








15 








20 








1356 


GTG 


TCG 


CAA 


GAG ATC 


ATA 


ACC 


AAA 


GCT CGC 


AAC 


GGT 


GAC 


TGG 


GAG 


366 




1357 


Val 


Ser 


Gin 


Glu lie 


lie 


Thr 


Lys 


Ala Arg 


Asn 


Gly 


Asp 


Trp 


Glu 






1358 








25 








30 








35 








1360 


GAC 


GAT 


CCT 


AAA CTG 


AAA 


CGC 


CAA 


GTT TTT 


TGC 


GTG 


GCC 


AGG 


AAC 


411 




1361 


Asp 


Asp 


Pro 


Lys Leu 


Lys 


Arg 


Gin 


Val Phe 


Cys 


Val 


Ala 


Arg 


Asn 






1362 








40 








45 








50 








1364 


GCC 


GGT 


CTG 


GCC ACG 


GAA 


TCG 


GGA 


GAG GTG 


GTG 


GTC 


GAC 


GTG 


TTG 


456 




1365 


Ala 


Gly 


Leu 


Ala Thr 


Glu 


Ser 


Gly 


Glu Val 


Val 


Val 


Asp 


Val 


Leu 






1366 








55 








60 








65 








1368 


AGG 


GAG 


AAG 


GTG AGG 


AAG 


GTC 


ACT 


GAC AAC 


GAC 


GAA 


GAA 


ACT 


GAG 


501 




1369 


Arg 


Glu 


Lys 


Val Arg 


Lys 


Val 


Thr 


Asp Asn 


Asp 


Glu 


Glu 


Thr 


Glu 






1370 








70 








75 








80 








1372 


AAA 


ATC 


ATC 


AAT AAG 


TGC 


GCC 


GTC 


AAG AGA 


GAT 


ACT 


GTT 


GAA 


GAG 


546 




1373 


Lys 


lie 


lie 


Asn Lys 


Cys 


Ala 


Val 


Lys Arg 


Asp 


Thr 


Val 


Glu 


Glu 






1374 








85 








90 








95 






E--> 


1376 


ACG 


GTG 


TTC 


AAT ACT 


TTC 


AAA 


TGT 


GTC ATG 


AAA 


AAC 


AAG 


CCA 


AAG 


(595 




1377 


Thr 


Val 


Phe 


Asn Thr 


Phe 


Lys 


Cys 


Val Met 


Lys 


Asn 


Lys 


Pro 


Lys 






1378 








100 








105 








110 






E--> 


1380 


TTC 


TCA 


CCA 


GTT GAT 


TGA 


ACCACCACGA C TAG TAG ATG GTTCAAATGG 


643 




1381 


Phe 


Ser 


Pro 


Val Asp 


* 






















1382 








115 






















E--> 


1384 


TGTGCTTTAC ATATAAAAAT AAAGTGTTTC TGATGTAAAA 


AAAAAAAAAA 




693 


E--> 


1386 


AAAAAAAAAA AACTCGAGAG TATTCTAGAG CGGCCGCGGG 


CCCATCGTTT 




743 


E--> 


1388 


TCCACCCCTC GAGCACCACC ACCACCACCA CTGAGAT 










777 


C--> 


1391 


(2) 


INFORMATION FOR 


SEQ 


ID NO: 25 
















1392 




(i) 


SEQUENCE CHAR^ 


^ERISTICS : 
















1393 






(A) LENGTH:/ 170 jAmino 


Acids 















0/ 
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1394 (B) TYPE: Amino Acid 

1395 (C) STRANDEDNESS : single 

1396 (D) TOPOLOGY: linear 

1397 (ii) MOLECULE TYPE: Protein 

1398 (iii) HYPOTHETICAL: no 

1399 (iv) ANTI-SENSE: no 

1400 (vi) ORIGINAL SOURCE: 

1401 (A) ORGANISM: Tenebrio molitor 
C--> 14 02 (C) INDIVIDUAL ISOLATE: none 

C-->1403 (G) CELL TYPE: fat body and whole organism 

1404 (vii) IMMEDIATE SOURCE: 

1405 (A) LIBRARY: cDNA 

1406 (B) CLONE: 13.17 

1407 ( ms^ATU RES" 

1409 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 
1411 
1412 
1414 
1415 
1417 
1418 
1420 



E--> 



1421 -5 5 ^^^{f 



Met 


Gly 


Ser 
-55 


Ser 


His 


His 


His 


His 
-50 


His 


His 


Ser 


Ser 


Gly 
-45 


Leu 


Val Pro 


Arg 


Gly 


Ser 


His 


Met 


Ala 


Ser 


Met 


Thr 


Gly Gly Gin 


Gin 


Met 


Gly Arg 




-40 










-35 










-30 








Gly 


Ser 


Glu 


Phe 


Ala 


Arg 


Ala 


Lys 


Met 


Lys 


Leu 


Leu 


Cys 


Cys 


Leu He 


-25 










-20 










-15 








-10 


Ser 


Leu 


He 


Leu 


Leu 
-5 


Val 


Thr 


Val 


Gin 


Ala 


Leu 


Thr 


Glu 


Ala 


Gin He 
5 


Glu 


Lys 


Leu 


Asn 


Lys 


He 


Ser 


Lys 


Lys 


Cys 


Gin 


Asn 


Glu 


Ser Gly Val 








10 










15 










20 




Ser 


Gin 


Glu 


He 


He 


Thr 


Lys 


Ala 


Arg 


Asn 


Gly Asp 


Trp 


Glu 


Asp Asp 






25 










30 










35 






Pro 


Lys 
40 


Leu 


Lys 


Arg 


Gin 


Val 
45 


Phe 


Cys 


Val 


Ala 


Arg 
50 


Asn 


Ala 


Gly Leu 


Ala 


Thr 


Glu 


Ser 


Gly 


Glu 


Val 


Val 


Val 


Asp 


Val 


Leu 


Arg 


Glu 


Lys Val 


55 










60 










65 








70 


Arg 


Lys 


Val 


Thr 


Asp 
75 


Asn 


Asp 


Glu 


Glu 


Thr 
80 


Glu 


Lys 


He 


lie 


Asn Lys 
85 


Cys 


Ala 


Val 


Lys 
90 


Arg 


Asp 


Thr 


Val 


Glu 
95 


Glu 


Thr 


Val 


Phe 


Asn 
100 


Thr Phe 


Lys 


Cys 


Val 
105 


Met 


Lys 


Asn 


Lys 


Pro 
110 


Lys 


Phe 


Ser 


Pro 


Val 
115 


Asp 




(2) 


INFORMATION 


FOR 


SEQ 


ID NO: 27 















1423 

1424 

1426 

1427 

1429 

1430 

1432 

1433 

1435 

1436 

1438 

1439 
E--> 1441 
E--> 1442 
C--> 1512 

1513 (i) SEQUENCE CHARACTERISTICS: 

1514 (A) LENGTH: 149 Amino Acids 

1515 (B) TYPE: Amino Acid 

1516 (C) STRANDEDNESS : single 

1517 (D) TOPOLOGY: linear 

1518 (ii) MOLECULE TYPE: Protein 

1519 (iii) HYPOTHETICAL: no 

1520 (iv) ANTI-SENSE: no 

1521 (vi) ORIGINAL SOURCE: 

1522 (A) ORGANISM: Tenebrio molitor 
C-->1523 (C) INDIVIDUAL ISOLATE: none 



file://C:\CRF3\Outhold\VsrI876348.htm 6/27/0 1 



Page 15 of 28 



RAW SEQUENCE LISTING DATE: 06/27/2001 

PATENT APPLICATION: US/09/876 , 348 TIME: 15:25:39 

Input Set : A:\SEQLIST.txt 

Output Set: N:\CRF3\06272001\I876348.raw 

C--> 1524 (G) CELL TYPE: fat body and whole organism 

1525 (vii) IMMEDIATE SOURCE: 

1526 (A) LIBRARY: CDNA 

1527 (B) CLONE: 13.17 

1530 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 



1532 


Met 


Gly 


Ser 


Ser 


His 


His 


His 


His 


His 


His 


Ser Ser 


Gly 


Leu 


Val 


Pro 


1533 








-30 










-25 








-20 






1535 


Arg 


Gly 


Ser 


His 


Met 


Ala 


Ser 


Met 


Thr 


Gly Gly Gin 


Gin 


Met 


Gly 


Arg 


1536 






-15 










-10 








-5 








1538 


Gly Leu 


Thr 


Glu 


Ala 


Gin 


He 


Glu 


Lys 


Leu 


Asn Lys 


He 


Ser 


Lys 


Lys 


1539 




1 








5 










10 








15 


1541 


Cys 


Gin 


Asn 


Glu 


Ser 


Gly 


Val 


Ser 


Gin 


Glu 


He He 


Thr 


Lys 


Ala 


Arg 


1542 










20 










25 








30 




1544 


Asn 


Gly Asp 


Trp 


Glu 


Asp 


Asp 


Pro 


Lys 


Leu 


Lys Arg 


Gin 


Val 


Phe 


Cys 


1545 








35 










40 








45 






1547 


Val 


Ala 


Arg 


Asn 


Ala 


Gly 


Leu 


Ala 


Thr 


Glu 


Ser Gly 


Glu 


Val 


Val 


Val 


1548 






50 










55 








60 








1550 


Asp 


Val 


Leu 


Arg 


Glu 


Lys 


Val 


Arg 


Lys 


Val 


Thr Asp 


Asn 


Asp 


Glu 


Glu 


1551 




65 










70 








75 










1553 


Thr 


Glu 


Lys 


He 


He 


Asn 


Lys 


Cys 


Ala 


Val 


Lys Arg 


Asp 


Thr 


Val 


Glu 


1554 


80 










85 










90 








95 


1556 


Glu 


Thr 


Val 


Phe 


Asn 




Phe 


Lys 


Cys 


Val 


Met Lys 


Asn 


Lys 


Pro 


Lys 


1557 










100 


? 








105 








110 




1559 


Phe 


Ser 


Pro 


Val 


Asp 






















1560 








115 
























1639 


(2) 


INFORMATION 


FOR 


SEQ 


ID NO: 29 















1640 (i) SEQUENCE CHARACTERISTICS: 

1641 (A) LENGTH: 173 Amino Acids 

1642 (B) TYPE: Amino Acid 

1643 (C) STRANDEDNESS : single 

1644 (D) TOPOLOGY : linear 
164 5 (ii) MOLECULE TYPE: Protein 

1646 (iii) HYPOTHETICAL: no 

1647 (iv) ANTI-SENSE: no 
164 8 (Vi) ORIGINAL SOURCE: 
1649 (A) ORGANISM: Tenebrio molitor 

C--> 1650 (C) INDIVIDUAL ISOLATE: none 

C--> 1651 (G) CELL TYPE: fat body and whole organism 

1652 (Vii) IMMEDIATE SOURCE: 

1653 (A) LIBRARY: cDNA 

1654 (B) CLONE: 3.4 
C--> 1655 <ix) FEATURE: 

1656 (D) OTHER INFORMATION: Precursor protein with His-tag 

16 57 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

1659 Met Gly Ser Ser His His His His His His Ser Ser Gly Leu Val Pro 

1660 -55 -50 -45 

1662 Arg Gly Ser His Met Ala Ser Met Thr Gly Gly Gin Gin Met Gly Arg 

1663 -40 -35 -30 
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1665 


Gly 


Ser 


Glu 


Phe 


Ala 


Arg 


Ala 


Lys 


Met 


Lys 


Leu 


Leu 


Leu 


Cys 


Phe 


Ala 


1666 




-25 










-20 










-15 










1668 


Phe 


Ala 


Ala 


lie 


Val 


He 


Gly 


Ala 


Gin 


Ala 


Leu 


Thr 


Asp 


Glu 


Gin 


He 


1669 


-10 










-5 










1 








5 




1671 


Gin 


Lys 


Arg 


Asn 


Lys 


He 


Ser 


Lys 


Glu 


Cys 


Gin 


Gin 


Val 


Ser 


Gly 


Val 


1672 








10 










15 










20 






1674 


Ser 


Gin 


Glu 


Thr 


He 


Asp 


Lys 


Val 


Arg 


Thr Gly Val 


Leu 


Val 


Asp 


Asp 


1675 






25 










30 










35 








1677 


Pro 


Lys 


Met 


Lys 


Lys 


His 


Val 


Leu 


Cys 


Phe 


Ser 


Lys 


Lys 


Thr 


Gly 


Val 


1678 




40 










45 










50 










1680 


Ala 


Thr 


Glu 


Ala 


Gly 


Asp 


Thr 


Asn 


Val 


Glu 


Val 


Leu 


Lys 


Ala 


Lys 


Leu 


1681 


55 










60 










65 










70 


1683 


Lys 


His 


Val 


Ala 


Ser 


Asp 


Glu 


Glu 


Val 


Asp 


Lys 


He 


Val 


Gin 


Lys 


Cys 


1684 










75 










80 










85 




1686 


Val 


Val 


Lys 


Lys 


Ala 


Thr 


Pro 


Glu 


Glu 


Thr 


Ala 


Tyr 


Asp 


Thr 


Phe 


Lys 


1687 








90 










95 
















1689 


Val 


He 


Tyr 


Asp 


Ser 


Lys 


Pro 


Asp 


Phe 


Ser 


Pro 


He 










1690 






105 










110 



















C--> 1761 (2) INFORMATION FOR SEQ ID NO: 31 

1762 (i) SEQUENCE CHARACTERISTICS: 

1763 (A) LENGTH: 149 Amino Acids 

1764 (B) TYPE: Amino Acid 

1765 (C) STRANDEDNESS : single 

1766 (D) TOPOLOGY: linear 

1767 (ii) MOLECULE TYPE: Protein 

1768 (iii) HYPOTHETICAL: no 

1769 (iv) ANTI-SENSE: no 

1770 (vi) ORIGINAL SOURCE: 

1771 (A) ORGANISM: Tenebrio molitor 
C--> 1772 (C) INDIVIDUAL ISOLATE: none 
C-->1773 (G) CELL TYPE: fat body and whole organism 

1774 (vii) IMMEDIATE SOURCE: 

1775 (A) LIBRARY: cDNA 

1776 (B) CLONE: 3.4 
C--> 1777 (ix) FEATURE: 

1778 (D) OTHER INFORMATION: Mature Protein with His-tag 

1779 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 

1781 Met Gly Ser Ser His His His His His His Ser Ser Gly Leu Val Pro 

1782 -30 -25 -20 

1784 Arg Gly Ser His Met Ala Ser Met Thr Gly Gly Gin Gin Met Gly Arg 

1785 -15 -10 -5 

1787 Gly Ser Leu Thr Asp Glu Gin lie Gin Lys Arg Asn Lys He Ser Lys 

1788 1 5 10 

1790 Glu Cys Gin Gin Val Ser Gly Val Ser Gin Glu Thr He Asp Lys Val 

1791 15 20 25 30 

1793 Arg Thr Gly Val Leu Val Asp Asp Pro Lys Met Lys Lys His Val Leu 

1794 35 40 45 

1796 Cys Phe Ser Lys Lys Thr Gly Val Ala Thr Glu Ala Gly Asp Thr Asn 

1797 50 55 60 
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1799 Val Glu Val Leu Lys Ala Lys Leu Lys His Val Ala Ser Asp Glu Glu 

1800 65 70 75 

1802 Val Asp Lys lie Val Gin Lys Cys Val Val Lys Lys Ala Thr Pro Glu 

1803 80 85 90 

1805 Glu Thr Ala Tyr Asp Thr Phe Lys Val lie Tyr Asp Ser Lys Pro Asp 

1806 95 ^00 105 110 
E--> 1808 Phe Ser Pro He Aspf*\ 

E--> 1809 115^ 

C--> 1888 (2) INFORMATION FOR SEQ ID NO: 33 

1889 (i) SEQUENCE CHARACTERISTICS: 

1890 (A) LENGTH : 173 Amino Acids 

1891 (B) TYPE : Amino Acid 

1892 (C) STRANDEDNESS : single 

1893 (D) TOPOLOGY: linear 

1894 (ii) MOLECULE TYPE: Protein 

1895 (iii) HYPOTHETICAL: no 

1896 (iv) ANTI-SENSE: no 

1897 <vi) ORIGINAL SOURCE: 

1898 (A) ORGANISM: Tenebrio molitor 
C--> 1899 (C) INDIVIDUAL ISOLATE: none 

C--> 1900 (G) CELL TYPE: fat body and whole organism 

1901 (vii) IMMEDIATE SOURCE: 

1902 (A) LIBRARY: CDNA 

1903 (B) CLONE: 3.9 
C--> 1904 <ix) FEATURE: 

1905 (D) OTHER INFORMATION: Precursor Protein with His-tag 

1906 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 

1908 Met Gly Ser Ser His His His His His His Ser Ser Gly Leu Val Pro 

1909 -55 -50 -45 

1911 Arg Gly Ser His Met Ala Ser Met Thr Gly Gly Gin Gin Met Gly Arg 

1912 -40 ' -35 -30 

1914 Gly Ser Glu Phe Ala Arg Ala Lys Met Lys Leu Leu Leu Cys Phe Ala 

1915 -25 -20 -15 

1917 Phe Ala Ala lie Val He Gly Ala Gin Ala Leu Thr Asp Glu Gin He 

1918 -10 -5 15 

1920 Gin Lys Arg Asn Lys He Ser Lys Glu Cys Gin Gin Glu Ser Gly Val 

1921 10 15 20 

1923 Ser Gin Glu Thr He Asp Lys Val Arg Thr Gly Val Leu Val Asp Asp 

1924 25 30 35 

1926 Pro Lys Met Lys Lys His Val Leu Cys Phe Ser Lys Arg Thr Gly Val 

1927 40 45 50 

1929 Ala Thr Glu Ala Gly Asp Thr Asn Val Glu Val Leu Lys Ala Lys Leu 

1930 55 ItfbSP 60 65 70 
E--> 1932 Lys HisVal Ala Ser Asp Glu Glu Val Asp Lys He Val Gin Lys Cys 
E--> 1933 75 80 85 

1935 Val Val Lys Lys Ala Thr Pro Glu Glu Thr Ala Tyr Asp Thr Phe Lys 

E--> 1936 90 95 

E--> 1938 Cys He Tyr Asp Ser Lys Pro Asp Phe Ser Pro He Asp 

E--> 1939 105 110 115 



J. XIX flXC 
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DATE: 06/27/2001 
TIME: 15:25:39 



C--> 



C--> 

c--> 



c--> 



E--> 
E--> 



E--> 
E--> 
E--> 
C--> 



2009 

2010 
2011 
2012 
2013 
2014 
2015 
2016 
2017 
2018 
2019 
2020 
2021 
2022 
2023 
2024 
2025 
2026 
2027 
2029 
2030 
2032 
2033 
2035 
2036 
2038 
2039 
2041 
2042 
2044 
2045 
2047 
2048 
2050 
2051 
2053 
2054 
2056 
2057 
2139 
2140 
2141 
2142 
2143 
2144 
2145 
2146 
2147 
2148 



(2) INFORMATION FOR SEQ ID NO: 35 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 9 Amino Acids 

(B) TYPE: Amino Acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: Protein 

(iii) HYPOTHETICAL: no 
(iv) ANTI- SENSE: no 
(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Tenebrio molitor 
(C) INDIVIDUAL ISOLATE: none 
(G) CELL TYPE: fat body and whole organism 
(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: cDNA 

(B) CLONE: 3.9 





(ix) FEATURE: 






















(D) OTHER 


INFORMATION 


: Mature Protein 


with His-tag 




(xi) SEQUENCE DESCRIPTION: i 


seq ; 


ID NO: 35: 










Met 


Gly Ser 


Ser 


His 
■30 


His 


His 


His 


His 


His Ser Ser 
-25 


Gly 


Leu 


Val 
-20 


Pro 


Arg Gly Ser 


His 


Met 


Ala 


Ser 


Met 


Thr Gly Gly Gin 


Gin 


Met 


Gly 


Arg 






-15 










-10 






-5 






Gly 


Ser Leu 
1 


Thr 


Asp 


Glu 


Gin 
5 


He 


Gin 


Lys Arg Asn 
10 


Lys 


He 


Ser 


Lys 


Glu 


Cys Gin 


Gin 


Glu 


Ser 


Gly 


Val 


Ser 


Gin Glu Thr 


He 


Asp 


Lys 


Val 


15 








20 








25 








30 


Arg Thr Gly 


Val 


Leu 


Val 


Asp 


Asp 


Pro 


Lys Met Lys 


Lys 


His 


Val 


Leu 








35 










40 






45 




Cys 


Phe Ser 


Lys 
50 


Arg 


Thr 


Gly 


Val 


Ala 
55 


Thr Glu Ala 


Gly 


Asp 

60 


Thr 


Asn 


Val 


Glu Val 


Leu 


Lys 


Ala 


Lys 


Leu 


Lys 


HisVal Ala Ser Asp Glu Glu 




65 










70 






75 








Val 


Asp Lys 
80 


He 


Val 


Gin 


Lys 
85 


Cys 


Val 


Val Lys Lys 
90 


Ala 


Thr 


Pro 


Glu 


Glu 


Thr Ala 


Tyr 


Asp 


Thr 


Phe 


Lys 


Cys 


He Tyr Asp 


Ser 


Lys 


Pro Asp 


95 








t( 








105 








110 


Phe 


Ser Pro 


He 


Asp 
115 





















(2) INFORMATION FOR SEQ ID NO: 3 7 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 173 Amino Acids 

(B) TYPE: Amino Acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: Protein 

(iii) HYPOTHETICAL: no 
(iv) ANTI-SENSE: no 
(vi) ORIGINAL SOURCE: 
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2149 (A) ORGANISM: Tenebrlo molitor 

C--> 2150 (C) INDIVIDUAL ISOLATE: none 

C--> 2151 (G) CELL TYPE: fat body and whole organism 

2152 (vii) IMMEDIATE SOURCE: 

2153 (A) LIBRARY: cDNA 

2154 (B) CLONE: 7.5 
C--> 2155 (ix) FEATURE: 

2156 (D) OTHER INFORMATION: Precursor Protein with His-tag 

2157 (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37 
2159 
2160 
2162 
2163 
2165 
2166 
2168 
2169 
2171 
2172 
2174 
2175 
2177 
2178 
2180 
2181 
2183 
2184 
2186 
2187 

E--> 2189 
E--> 2190 

C--> 2260 (2) INFORMATION FOR SEQ ID NO: 39 

2261 (i) SEQUENCE CHARACTERISTICS: 

2262 (A) LENGTH: 149 Amino Acids 

2263 (B) TYPE: Amino Acid 

2264 (C) STRANDEDNESS : single 

2265 (D) TOPOLOGY: linear 

2266 (ii) MOLECULE TYPE: Protein 

2267 (iii) HYPOTHETICAL: no (j 

2268 (iv) ANTI-SENSE: no \1 

2269 (vi) ORIGINAL SOURCE: 

2270 (A) ORGANISM: Tenebrio molitor 
C--> 2271 (C) INDIVIDUAL ISOLATE: none 
C-->2272 (G) CELL TYPE: fat body and whole organism 

2273 (vii) IMMEDIATE SOURCE: 

2274 (A) LIBRARY: cDNA 

2275 (B) CLONE: 7.5 
C--> 2276 (ix) FEATURE: 

2277 (D) OTHER INFORMATION: Mature protein with His-tag 



Met Gly 


Ser 


Ser 


His 


His 


His 


His 


His 


His 


Ser 


Ser 


Gly 


Leu val 


Pro 








-55 










-50 










-45 




Arg 


Gly 


Ser 


His 


Met 


Ala 


Ser 


Met 


Thr 


Gly 


Gly Gin 


Gin 


Met Gly 


Arg 






-40 










-35 










-30 






Gly 


Ser 
-25 


Glu 


Phe 


Ala 


Arg 


Ala 
-20 


Lys 


Met 


Lys 


Leu 


Leu 
-15 


Leu 


Cys Phe 


Ala 


Phe 


Ala 


Ala 


He 


Val 


He 


Gly 


Ala 


Gin 


Ala 


Leu 


Thr 


Asp 


Glu Gin 


He 


-10 










-5 










1 






5 




Gin 


Lys 


Arg 


Asn 
10 


Lys 


He 


Ser 


Lys 


Glu 
15 


Cys 


Gin 


Gin 


Val 


Ser Gly 
20 


Val 


Ser 


Gin 


Glu 


Thr 


He 


Asp 


Lys 


Val 


Arg 


Thr 


Gly Val 


Leu 


Val Asp 


Asp 






25 










30 










35 






Pro 


Lys 
40 


Met 


Lys 


Lys 


His 


Val 
45 


Leu 


Cys 


Phe 


Ser 


Lys 
50 


Lys 


Thr Gly 


Val 


Ala 


Thr 


Glu 


Ala 


Gly 


Asp 


Thr 


Asn 


Val 


Glu 


Val 


Leu 


Lys 


Ala Lys 


Leu 


55 










60 










65 








70 


Lys 


His 


Val 


Ala 


Ser 
75 


Asp 


Glu 


Glu 


Val 


Asp 
80 


Lys 


He 


Val 


Gin Lys 
85 


Cys 


Val 


Val 


Lys 


Lys 
90 


Ala 


Thr 


Pro 


Glu 


Glu 
95 


Thr 


Ala 


Tyr 


Asp 


Thr Phe 


Lys 


Cys 


He 


Tyr 
105 


Asp 


Ser 


Lys 


Pro 


Asp 
110 


Phe 


Ser 


Pro 


He 


Asp 
115 







8.V 
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2278 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39 



2280 


Met Gly 


Ser 


Ser 


His 


His 


His 


His 


His 


His 


Ser 


Ser 


Gly 


Leu 


Val 


Pro 


2281 










-30 










■25 










-20 




2283 


Arg 


Gly 


Ser 


His 


Met 


Ala 


Ser 


Met 


Thr 


Gly Gly 


Gin 


Gin 


Met 


Gly 


Arg 


2284 








-15 










-10 










-5 






2286 


Gly 


Ser 


Leu 


Thr 


Asp 


Glu 


Gin 


He 


Gin 


Lys 


Arg 


Asn 


Lys 


He 


Ser 


Lys 


2287 






1 








5 










10 










2289 


Glu 


Cys 


Gin 


Gin 


Val 


Ser 


Gly 


Val 


Ser 


Gin 


Glu 


Thr 


He 


Asp 


Lys 


Val 


2290 


15 










20 










25 










30 


2292 


Arg 


Thr 


Gly 


Val 


Leu 


Val 


Asp 


Asp 


Pro 


Lys 


Met 


Lys 


Lys 


His 


Val 


Leu 


2293 










35 










40 










45 




2295 


Cys 


Phe 


Ser 


Lys 


Lys 


Thr 


Gly 


Val 


Ala 


Thr 


Glu 


Ala 


Gly 


Asp 


Thr 


Asn 


2296 








50 










55 










60 






2298 


Val 


Glu 


Val 


Leu 


Lys 


Ala 


Lys 


Leu 


Lys 


His 


Val 


Ala 


Ser 


Asp 


Glu 


Glu 


2299 






65 










70 










75 








2301 


Val 


Asp 


Lys 


lie 


Val 


Gin 


Lys 


Cys 


Val 


Val 


Lys 


Lys 


Ala 


Thr 


Pro 


Glu 


2302 




80 










85 










90 










2304 


Glu 


Thr 


Ala 


Tyr 


Asp 


Thr 


Phe 


Lys 


Cys 


He 


Tyr 


Asp 


Ser 


Lys 


Pro 


Asp 


2305 


95 




















105 










110 


2307 


Phe 


Ser 


Pro 


lie 


Asp 


























f 1 



E--> 

E--> 2308 115 w 

C--> 24 68 (2) INFORMATION FOR SEQ ID NO: 

2469 (i) SEQUENCE CHARM^ERIS 

2470 (A) LENGTH : 8 ITbase pairs 

2471 (B) TYPE: nubreic acid 

2472 (C) STRANDEDNESS : double 

2473 (D) TOPOLOGY: linear 
24 74 (ii) MOLECULE TYPE: CDNA to mRNA 

2475 (iii) HYPOTHETICAL: no 

2476 (iv) ANTI-SENSE: no 
24 77 (vi) ORIGINAL SOURCE: 

2478 (A) ORGANISM: Tenebrio molitor 

C-->2479 (C) INDIVIDUAL ISOLATE: none 

C-->2480 (G) CELL TYPE: fat body and whole organism 

2481 (vii) IMMEDIATE SOURCE: 

2482 (A) LIBRARY: cDNA 

2483 (B) CLONE: 2.2 

C--> 2484 (ix) FEATURE: $ 

2485 { D) OTHER INFORMATION: Concensus of Seq ID #44 with Tm 13.17 

2486 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 

2488 GGCANRNNNN AAR ATG AAR YTN CTC TNN TGY YTN RYN TYC NYC RYY 46 

2489 Met Lys Leu Leu Leu Cys Phe Ala Phe Ala Ala 

2490 -15 -10 

24 92 NTN NTN RTC RNA GYT CAG GCY CTN ACC GAN GNA CAR ATN NAG AAA 91 

2493 lie Val lie Gly Ala Gin Ala Leu Thr Asp Glu Gin He Gin Lys 

2494 -5 15 

24 96 NNG AAC AAG ATC AGC AAA RAR TGY CAR NAN GNR NNY GGA GTG TCN 136 

2497 Arg Asn Lys He Ser Lys Glu Cys Gin Gin Val Ser Gly Val Ser 

2498 10 15 20 
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2500 CAA GAG AYN ATN RNC AAA GYY CGC ANN GGT GNC TNG GNN GAY GAT 181 

2501 Gin Glu Thr lie Asp Lys Val Arg Thr Gly Val Leu Val Asp Asp 

2502 25 30 35 

2 504 CCY AAA NTG AAR NRN CAN GTY YTY TGC NTN NCN ARG ARN RCY GGN 226 
2 505 Pro Lys Met Lys Lys His Val Leu Cys Phe Ser Lys Lys Thr Gly 
2506 40 45 50 

2508 NTG GCN ACN GAA NCN GGA GAN RYN RNN GTN GAN GTR YTN ARR GNN 271 

2509 Val Ala Thr Glu Ala Gly Asp Thr Asn Val Glu Val Leu Lys Ala 

2510 55 60 65 

2 512 AAG NTG ARG NAN GTN RCY RRC AAC GAC GAA GAR RYN GAN AAR ATC 316 

2513 Lys Leu Lys His Val Ala Ser Asn Asp Glu Glu Val Asp Lys lie 

2514 70 75 80 

2516 RTN NAN AAG TGC GYN GTC AAG ARR GNY ACN NYN GAR GAR ACG GYN 361 

2517 Val Gin Lys Cys Val Val Lys Lys Ala Thr Pro Glu Glu Thr Ala 

2518 85 90 95 

2520 TNY RAY ACY TTC AAR NNT RTY NNN RAN ARY AAR CCN RAN TTC TCN 406 

2521 Tyr Asp Thr Phe Lys Cys lie Tyr Asp Ser Lys Pro Asp Phe Ser 

2522 100 105 110 

2524 CCN RTT GAT TRA NYNNYYNNNA YTNGNNNRNR NTTYRANAAT AAAGNNNNTN 4 58 

2525 Pro lie Asp * 

2526 115 
E--> 2528 TNRTNNNRNA AAAAAAAAAA AAAAAA 

C--> 2533 (2) INFORMATION FOR SEQ ID NO: 46 % t/flif 

2534 (i) SEQUENCE CHARA^ EE RI STIC Sj — * — 

2535 (A) LENGTH: U8l) base pairs 

2536 (B) TYPE: nucieic acid 

2537 (C) STRANDEDNESS : double 

2538 (D) TOPOLOGY: linear 
25 39 (ii) MOLECULE TYPE: CDNA to mRNA 

2540 (iii) HYPOTHETICAL: no q 

2541 (iv) ANTI-SENSE: no ^ }y 

2542 Jr&T 
254 3 (vi) ORIGINAL SOURCE: 
2544 (A) ORGANISM: Tenebrio molitor 

C--> 2545 (C) INDIVIDUAL ISOLATE: none 

C-->2546 (G) CELL TYPE: fat body and whole organism 

2 547 (vii) IMMEDIATE SOURCE: 

2548 (A) LIBRARY: cDNA 

2549 (B) CLONE: 2.2 ^ 
C--> 2550 <ix) FEATURE: * 

2551 (D) OTHER INFORMATION: Concensus of Seq ID #45 with B1/B2 

2 552 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 

2 554 GGCANRNNNN AAR ATG AAR YTN CTC TNN TGY YTN RYN TYY NYC RYY 46 
2 555 Met Lys Leu Leu Leu Cys Phe Ala Phe Ala Ala 

2556 -15 -10 

2 558 NTN NTN RTC NNA GYT CAG GCY NTN ACY NAN GNA NAN NTN NAG NNA 91 

2559 lie Val He Gly Ala Gin Ala Leu Thr Asp Glu Gin He Gin Lys 

2560 -5 15 

2 562 NNG NNC NAR AYC AGC RNA RAR TGY NAR NNN GNR NNY GGA GTG TCN 136 
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RAW SEQUENCE LISTING DATE: 06/27/2001 

PATENT APPLICATION: US/09/876, 348 TIME: 15:25:40 

Input Set : A:\SEQLIST.txt 
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2563 


Arg 


Asn 


Lys 


He 


Ser 


Lys 


Glu 


Cys 


Gin 


Gin 


Val 


Ser Gly 


Val 


Ser 




2564 




10 










15 










20 










2566 


NAA 


GAN 


RYN 


ATN 


RNN 


ARA 


GYY 


CGC 


ANN 


GGT 


GNC 


TNG 


GNN 


GAY 


GAY 


181 


2567 


Gin 


Glu 


Thr 


He 


Asp 


Lys 


Val 


Arg 


Thr 


Gly 


Val 


Leu 


Val 


Asp 


Asp 




2568 




25 










30 










35 










2570 


CCY 


AAA 


NTG 


AAR 


NNN 


CAN 


NTY 


YTY 


TGC 


NTN 


NYN 


ARG 


RNN 


NYY 


GRN 


226 


2571 


Pro 


Lys 


Met 


Lys 


Lys 


His 


Val 


Leu 


Cys 


Phe 


Ser 


Lys 


Lys 


Thr 


Gly 




2572 




40 










45 










50 










2574 


NTR 


GYN 


RCN 


GAA 


NCN 


GGA 


GAN 


RYN 


RNN 


GYN 


GAN 


RYR 


YTN 


ARR 


GNN 


271 


2575 


Val 


Ala 


Thr 


Glu 


Ala 


Gly 


Asp 


Thr 


Asn 


Val 


Glu 


Val 


Leu 


Lys 


Ala 




2576 




55 










60 










65 










2578 


AAG 


NTG 


ANG^ 


NRN 


NNN 


NNN 


RNN 


ANN 


RNN 


RAR 


RAR 


RYN 


RRN 


ARR 


NTN 


316 


2579 


Lys 


Leu 


Lys* 


His 


Val 


Ala 


Ser 


Asn 


Asp 


Glu 


Glu 


Val 


Asp 


Lys 


He 




2580 




70 










75 










80 










2582 


NYN 


NRN 


ARN 


NNN 


NNN 


NNN 


NNG 


ARN 


RNN 


NYN 


NNN 


RAR 


RNR 


NNN 


NNN 


361 


2583 


Val 


Gin 


Lys 


Cys 


Val 


Val 


Lys 


Lys 


Ala 


Thr 


Pro 


Glu 


Glu 


Thr 


Ala 




2584 




85 










90 










95 










2586 


TNN 


RAN 


NYN 


YYN 


AAN 


NNN 


NNY 


NNN 


RRN 


ANN 


ARN 


CCN 


RNN 


TYY 


TYN 


406 


2587 


Tyr 


Asp 


Thr 


Phe 


Lys 


Cys 


He 


Tyr 


Asp 


Ser 


Lys 


Pro 


Asp 


Phe 


Ser 




2588 




100 










105 










110 











2 590 CNN RYT RNT TRN NYNNNNNNNN YNNGNNNRNR NTTYRANAAT AAAGNNNYTN 458 

2591 Pro He Asp * 

2592 115 

E--> 2594 TNRTNNNRNA AAAAAAAAAA AAAAAA <x / / /?Lp (484 

C--> 2599 (2) INFORMATION FOR SEQ ID NO: 

2600 (i) SEQUENCE CHARSCfERTSflCS : 

2601 (A) LENGTH: U 81/ base pairs 

2602 (B) TYPE: nubteic acid |J 

2603 (C) STRANDEDNESS : double 

2604 (D) TOPOLOGY: linear 

2605 (ii) MOLECULE TYPE: cDNA to mRNA 

2606 (iii) HYPOTHETICAL: no 

2607 (iv) ANTI-SENSE: no 

2608 (vi) ORIGINAL SOURCE: 

2609 (A) ORGANISM: Tenebrio molitor 
C--> 2610 (C) INDIVIDUAL ISOLATE: none 

C--> 2611 (G) CELL TYPE: fat body and whole organism 

2612 (vii) IMMEDIATE SOURCE: 

2613 (A) LIBRARY: cDNA 

2614 (B) CLONE: 2,2 r 
C--> 2615 (ix) FEATURE: 

2616 (D) OTHER INFORMATION: Concensus of Seq. ID #46 with AFP-3 

2617 (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 

2620 GGCNNRNNNN AAR ATG AAR YTN CTC YNN TGY YTN RYN YYY NYY RYY 4 6 

2621 Met Lys Leu Leu Leu Cys Phe Ala Phe Ala Ala 

2622 -15 -10 

2624 NTN NTN RYC NNR RYY YAN GCY NTN ACY NAN RNA NNN NNN NAG NNR 91 

2625 He Val lie Gly Ala Gin Ala Leu Thr Asp Glu Gin lie Gin Lys 

2626 -5 15 



file://C:\CRF3\Outholrf\VsrI876348.htm 



6/27/01 



Page 23 of 28 



RAW SEQUENCE LISTING DATE: 06/27/2001 

PATENT APPLICATION: US/09/876, 348 TIME: 15:25:40 

Input Set : A:\SEQLIST.txt 

Output Set: N:\CRF3\06272001\l876348.raw 



2628 
2629 
2630 
2632 
2633 
2634 
2636 
2637 
2638 
2640 
2641 
2642 
2644 
2645 
2646 
2648 
2649 
2650 
2652 
2653 
2654 
2656 
2657 
2658 
2660 
2664 
2665 
2666 
2667 
2668 
. 2669 
2670 
2671 
2672 
2673 
2674 
C--> 2675 
C--> 2676 
2677 
2678 
2679 
C--> 2680 
2681 
2682 
2684 
2686 
E--> 2687 
E--> 2688 
E--> 2689 



NNG NNY NAR NNC AGC RNN RNN TGY NAR NNN GNR NNY GGA GTR TCN 

Arg Asn Lys He Ser Lys Glu Cys Gin Gin Glu Ser Gly Val Ser 

10 15 20 

NAA GAN NYN NTN RNN ARR GYY CGC ANN NGT GNN NNR GNN GAY GAY 

Gin Glu Thr He Asp Lys Val Arg Thr Gly Val Leu Val Asp Asp 

25 30 35 

CCY AAA NTG AAR NNN CAN NYY YTY TGC NTN NYN ARG RNN NYY GRN 

Pro Lys Met Lys Lys His Val Leu Cys Phe Ser Lys Lys Thr Gly 

40 45 50 

NTN RYN RNN GNN NNN GGN GAN NYN NNN NYN GAN NNN NTN ARR RNN 

Val Ala Thr Glu Ala Gly Asp Thr Asn Val Glu Val Leu Lys Ala 



55 



60 



65 



E- 
C- 



AAR NTN ANG NRN NNN NNN RNN RNN NNN RAR RAR RYN RRN RRN NTN 
Lys Leu Lys His Val Ala Ser Asn Asp Glu Glu Val Asp Lys He 

70 75 80 

NYN NNN ARN NNN NNN NNN NNG ARN RNN NYN NNN NAR NNN NNN NNN 
Val Gin Lys Cys Val Val Lys Lys Ala Thr Pro Glu Glu Thr Ala 

85 90 95 

NNN RAN NYN YYN AAN NNN NNY NNN RRN ANN ARN YCN NNN TNN NNN 
Tyr Asp Thr Phe Lys Cys He Tyr Asp Ser Lys Pro Asp Phe Ser 

100 105 110 

CNN NYN RNN TRN NNNNNNNNNN YNNRNNNNNN NNNNNNNAAT AAANNNNNNN 
Pro He Asp * 

115 

NNNNNNNNNA AAAAAAAAAA AAAAAA 

(2) INFORMATION FOR SEQ ID NO: 4 8 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 133 Amino Acids 

(B) TYPE: Amino Acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear / 
(ii) MOLECULE TYPE: Protein / 

(iii) HYPOTHETICAL: no f 
(iv) ANTI-SENSE: no ^ 
(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Tenebrio molitor 

(C) INDIVIDUAL ISOLATE: none 

(G) CELL TYPE: fat body and whole organism 
(Vii) IMMEDIATE SOURCE : 

(A) LIBRARY: CDNA 

(B) CLONE: 
(ix) FEATURE: >^^N 

(D) OTHER INFORMATION: (^enral Concensus of Clones 
Bl, B2 and AFP-3 ^^—^ 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 48: 
Met Lys Leu Leu Leu Cys Phe Ala Phe Ala Ala lie Val He Gl 
7?ys Leu He Ser Leu He Leu Leu Val Thr 

Thr Leu Val Ala Ala 

Val 



136 



181 



226 



271 



316 



361 



406 



458 
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E- 


-> 


2690 








-15 










-10 








-5 










2692 


Gin 


Ala 


Leu 


Thr 


Asp 


Glu 


Gin. 


He 


Gin 


Lys 


Arg 


Asn 


Lys He 


Ser 


Lys 


E- 


-> 


2693 


Tyr 




He 




Glu 


Ala 


Asp 


Leu 


Glu 


Leu 


Leu 


Arg 


Gin Thr 




Ala 


E- 


-> 


2694 












Thr 


Pro 


Arg 








Lys 


His 




Asp 


E- 


-> 


2695 






1 










5 










10 










2697 


Glu 


Cys 


Gin 


Gin 


Glu 


Ser 


Gly 


Val 


Ser 


Gin 


Glu 


Thr 


He Asp 


Lys 


Val 


E- 


-> 


2698 


Lys 




Lys 


Asn 


Val 






Ala 




Glu 


Asp 


He 


Leu Thr 


Arg 


Ala 


E- 


-> 


2699 


Ala 






Thr 














Ala 


Val 


Lys 






E- 


-> 


2700 








Ala 
















Ser 


Asn 






E- 


-> 


2701 


15 










20 










25 








30 






2703 


Arg 


Thr 


Gly 


Val 


Leu 


Val 


Asp 


Asp 


Pro 


Lys 


Met 


Lys 


Lys His 


Val 


Leu 


E- 


-> 


2704 




Asn 


Arg 


Asp 


Trp 


Glu 










Leu 




Arg Gin 


Leu 


Phe 


E- 


-> 


2705 




Lys 




Glu 


Glu 
















Met 


Ala 




E- 


-> 


2706 


























Glu 






E- 


-> 


2707 










35 










40 








45 








2709 


Cys 


Phe 


Ser 


Lys 


Lys 


Thr 


Gly Val 


Ala 


Thr 


Glu Ala Gly Asp 


Thr 


Asn 


E- 


-> 


2710 




Val 


Ala 


Arg 


Arg 


Ala 


He 


Leu 


Val 


Ala 


Ala 


Ser 


Glu 


He 


Glu 


E- 


-> 


2711 




He 


Phe 




Ala 


Leu 


Glu 


He 


He 


Asp 








Val 


Val 


E- 


-> 


2712 






Leu 




Asn 






Glu 












Phe 


Gin 


E- 


-> 


2713 
















Phe 
















E- 


-> 


2714 








50 










55 








60 






E- 


-> 


2716 


Val 


Glu 


Val 


Leu 


Lys 


Ala 


Lys 


Leu 


Lys 


His 


Val 


Ala 


Ser AAA 


Asp 


Glu 


E- 


-> 


2717 


Ala 


Asp 


Thr 


Phe 


Arg 


Glu 




Val 


Thr 


Arg 


Asn Thr Asn Asp 


Pro 




E- 


-> 


2718 


Leu 




His 


He 




Thr 




Phe 


Arg 


Lys 




Ser Asp Asn 






E- 


-> 


2719 




















Glu 






Glu His 






E- 


-> 


2720 






65 










70 










75 










2722 


Glu 


Val 


Asp 


Lys 


He 


Val 


Gin 


Lys 


Cys 


Val 


Val 


Lys 


Lys Ala 


Thr 


Pro 


E- 


-> 


2723 


Lys 


Ser 


Glu 


Asp 


Leu 


He 


Glu 






Ala 




Thr 


Glu Asp 




Val 


E- 


-> 


2724 




Thr 










Asn 






Thr 






Arg 






E- 


-> 


2725 














Ala 


















E- 


-> 


2726 




80 










85 










90 












2728 


Glu 


Glu 


Thr 


Ala 


Tyr 


Asp 


Thr 


Phe 


Lys 


Cys 


He 


Tyr Asp Ser 


Lys 


Pro 


E- 


-> 


2729 


Gin Asp 


Ser 


Val 


Phe 


Glu 


Val 


Thr 




Val 


Val 


Leu 


Lys Asn Arg 


Ser 


E- 


-> 


2730 




His 




Ser 


Ala 


Asn 


Phe 










Met 


Asp 






E- 


-> 


2731 
























His 








E- 


-> 


2732 


95 










100 










105 








110 


E- 


-> 


2734 


Asp 


Phe 


Ser 


Pro 


He 


Asp 


AAA 
















E- 


-> 


2735 


Asn 




Phe 


Gly 


Asp 


Leu 


Phe 


Val 




w 


> 


) 








E- 


-> 


2736 


Lys 








Val 






















E- 


-> 


2737 










115 






















E- 


-> 


2742, 


f 166 




























E- 


-> 


2746[ 


.RB125 RTi\ \ 
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L: 
L : 
L: 
L: 
L: 
L: 
L: 
L: 
L: 
L: 
L: 
L: 
L: 
L: 
L: 
L: 
L: 
M: 
L: 
L: 
L : 
L: 
L: 
L: 
L: 
L: 
L: 
L: 
L: 
L: 
L: 
L: 
L: 
L: 
L: 
L: 
L: 
L: 
L: 
L: 
L: 
L: 
L: 
L: 
L: 
L: 
L: 
L: 



3 M:220 C: 
8 M:220 C: 

13 M:220 C 

14 M:220 

20 M:220 

21 M:220 
23 M:220 
26 M:220 
30 M:220 
35 M:220 
47 M:220 

M:220 
M:220 
M:220 
M:220 
M:220 
M:254 
Repeated 
M:204 E: 
M:220 
M:220 
M:220 
151 M:332 
171 M:342 
M:203 
M:220 
M:220 
M:220 
M:342 
217 M:203 
220 M:220 
M:220 



48 

52 

60 

71 

72 

80 

254 

124 

127 

138 

139 



172 
175 
186 
187 
216 



C: 
C: 
C: 
E: 
E: 
E: 



231 
232 
282 
293 
294 
344 



220 
220 
220 
220 
220 



No. of Seq. differs, LENGTH : Input : 116 Found: 117 SEQ:4 



355 M:220 

356 M:220 C: 

388 M:342 E: 

389 M:203 
392 M:220 

M:220 
M:220 
M:342 
M:203 
M:220 
M:220 



Keyword misspelled or invalid format. 
Keyword misspelled or invalid format, 
Keyword misspelled or invalid format, 
Keyword misspelled or invalid format, 
Keyword misspelled or invalid format, 
Keyword misspelled or invalid format, 
Keyword misspelled or invalid format, 
Keyword misspelled or invalid format, 
Keyword misspelled or invalid format, 
Invalid Stop Code On Error, STOP CODON 



403 
404 
433 
434 
437 
448 



E: 
C: 
C: 



No. of Seq. differs, LENGTH: Input : 133 Found: 134 SEQ: 7 



(2) INFORMATION FOR SEQ ID 
(C) INDIVIDUAL ISOLATE : ] 
(G) CELL TYPE:] 



Keyword misspelled or invalid format, 
Keyword misspelled or invalid format, 
C: Keyword misspelled or invalid format, 
E: Invalid Stop Code On Error, STOP CODON 
E: No. of Seq. differs, LENGTH: Input : 115 Found: 116 SEQ: 8 
C: Keyword misspelled or invalid format, 
C: Keyword misspelled or invalid format, 



F) 
v) 
A) 
B) 



Keyword misspelled or invalid format, [ 
Keyword misspelled or invalid format, [ 
Keyword misspelled or invalid format, 
Keyword misspelled or invalid format, 
Keyword misspelled or invalid format, 
Keyword misspelled or invalid format, 
Keyword misspelled or invalid format, 
Keyword misspelled or invalid format, 
Keyword misspelled or invalid format, 
Keyword misspelled or invalid format, 
Keyword misspelled or invalid format, 
Keyword misspelled or invalid format, 
Keyword misspelled or invalid format, 
Keyword misspelled or invalid format, 
Keyword misspelled or invalid format, 
Keyword misspelled or invalid format, 
No. of Bases conflict, Input: 36 Counted 
in SeqNo=2 

No. of Bases differ, LENGTH : Input : 566 Counted: 576 SEQ: 2 
Keyword misspelled or invalid format, 
Keyword misspelled or invalid format, 
Keyword misspelled or invalid format, 
(32) Invalid/Missing Amino Acid Numbering, 
Invalid Stop Code On Error, STOP CODON 
No. of Seq. differs, LENGTH: Input : 134 Found:135 SEQ:3 
C: Keyword misspelled or invalid format, 
C: Keyword misspelled or invalid format, 
Keyword misspelled or invalid format, 
Invalid Stop Code On Error, STOP CODON 



1) GENERAL INFORMATION: ] 
iv) CORRESPONDENCE ADDRESS : ] 
ZIP: ] 

COMPUTER READABLE FORM:] 
APPLICATION NUMBER : ] 
FILING DATE : ] 

vii) PRIOR APPLICATION DATA:] 

viii) ATTORNEY/AGENT INFORMATION : ] 

ix) TELECOMMUNICATION INFORMATION : ] 
2) INFORMATION FOR SEQ ID NO:] 
C) INDIVIDUAL ISOLATE : ] 
G) CELL TYPE:] 
ix) FEATURE : ] 

2) INFORMATION FOR SEQ ID NO:] 
INDIVIDUAL ISOLATE:] 
CELL TYPE : ] 
SEQ: 2 



C) 
G) 
46 



(2) 
(C) 
(G) 



(2) 
(C) 
(G) 
* 



INFORMATION FOR SEQ ID 
INDIVIDUAL ISOLATE : ] 
CELL TYPE:] 
SEQ ID: 3 



INFORMATION FOR SEQ ID 
INDIVIDUAL ISOLATE:] 
CELL TYPE: ] 



NO: ] 



NO: ] 



(2) INFORMATION FOR SEQ ID NO:] 

<C) INDIVIDUAL ISOLATE : ] 

(G) CELL TYPE: ] 

(2) INFORMATION FOR SEQ ID NO:] 

(C) INDIVIDUAL ISOLATE : ] 

<G) CELL TYPE: ] 

(2) INFORMATION FOR SEQ ID NO:] 

(C) INDIVIDUAL ISOLATE : ] 

(G) CELL TYPE:] 
* 



(2) 
(C) 



INFORMATION FOR SEQ ID 
INDIVIDUAL ISOLATE : ] 



NO: ] 



NO: J 
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PATENT APPLICATION: US/09/876,348 



DATE: 06/27/2001 
TIME: 15:25:41 



L:449 M:220 C: 
L:500 M:220 C: 
L:511 M:220 C: 
L:512 M:220 C: 
L:544 M:342 E: 
L:545 M:203 E: 
L:548 M:220 C: 
L:559 M:220 C: 
L:560 M:220 C: 
L:589 M:342 E: 
L:590 M:203 E: 
L:593 M:220 C: 
L:604 M:220 C: 
L:605 M:220 C: 
L:656 M:220 C: 
L:667 M:220 C: 
L:668 M:220 C: 
L:691 M:333 E: 
L:692 M:332 E: 
M:332 Repeated 
L:700 M:342 E 
L:704 M:220 C 
L:736 M:333 E 
L:737 M:332 E 
M:332 Repeated 
L:745 M:342 E 
L:773 M;254 E 
L:938 M:342 E 
L:939 M:203 E 
L-.1056 M:342 E 
L:1057 M:203 E 
L:U86 M:342 E 
L:1187 M:203 E 
L:1304 M:342 E 
L:1305 M:203 E 
L:1376 M:254 E 
M:254 Repeated 
L:1388 M:204 E 
L:1421 M:332 E 
M: 342 Repeated 
L:1442 M:203 E 
L:1559 M:342 E 
L-.1560 M:203 E 
L:1689 M:342 E 
L:1690 M:203 E 
L:1750 M:336 W 
L:1754 M:336 W 
L:1758 M:336 W 
L:1808 M:342 E 



INFORMATION FOR SEQ ID NO:] 
INDIVIDUAL ISOLATE : ] 



[(C) INDIVIDUAL ISOLATE : ] 
[(G) CELL TYPE: ] 

[(2) INFORMATION FOR SEQ ID NO:] 
[(C) INDIVIDUAL ISOLATE:] 
[(G) CELL TYPE: ] 



Input Set : A:\SEQLIST.txt 
Output Set: N:\CRF3\06272001\I876348.raw 

Keyword misspelled or invalid format, [(G) CELL TYPE:] 
Keyword misspelled or invalid format, [(2) 
Keyword misspelled or invalid format/ [(C) 
Keyword misspelled or invalid format/ [(G) CELL TYPE:] 
Invalid Stop Code On Error, STOP CODON:* 
No. of Seq. differs, LENGTH : Input : 13 3 Found:134 SEQ:10 
Keyword misspelled or invalid format, [(2) INFORMATION FOR SEQ ID NO:] 
Keyword misspelled or invalid format, [(C) INDIVIDUAL ISOLATE: ] 
Keyword misspelled or invalid format, [(G) CELL TYPE:] 
Invalid Stop Code On Error, STOP CODON:* 
No. of Seq. differs, LENGTH : Input : 115 Found: 116 SEQ: 11 
Keyword misspelled or invalid format, [(2) INFORMATION FOR SEQ ID NO:] 
Keyword misspelled or invalid format, 
Keyword misspelled or invalid format, 
Keyword misspelled or invalid format, 
Keyword misspelled or invalid format, 
Keyword misspelled or invalid format, 
Wrong sequence grouping, Amino acids not in groups! 
(32) Invalid/Missing Amino Acid Numbering, SEQ ID: 13 
in SeqNo=13 

Invalid Stop Code On Error, STOP CODON:* 

Keyword misspelled or invalid format, [(2) INFORMATION FOR SEQ ID NO:] 
Wrong sequence grouping, Amino acids not in groups! 
(32) Invalid/Missing Amino Acid Numbering, SEQ ID: 14 
in SeqNo=14 

Invalid Stop Code On Error, STOP CODON:* 

No. of Bases conflict, Input: 90 Counted: 91 SEQ : 15 

Invalid Stop Code On Error, STOP CODON : * 

No. of Seq. differs, LENGTH : Input : 169 Found: 174 SEQ; 17 
: Invalid Stop Code On Error, STOP CODON:* 
: No. of Seq. differs, LENGTH : Input : 149 Found: 150 SEQ : 19 
: Invalid Stop Code On Error, STOP CODON:* 
: No. of Seq. differs, LENGTH : Input : 169 Found: 174 SEQ : 21 
: Invalid Stop Code On Error, STOP CODON:* 
: No. of Seq. differs, LENGTH : Input : 149 Found:150 SEQ:23 
: No. of Bases conflict, Input: 595 Counted: 591 SEQ: 24 

in SeqNo-24 

: No. of Bases differ, LENGTH : Input : 777 Counted:776 SEQ:24 
: (32) Invalid/Missing Amino Acid Numbering, SEQ ID: 25 
in SeqNo=25 

No. of Seq. differs, LENGTH : Input : 170 Found: 175 SEQ: 25 
Invalid Stop Code On Error, STOP CODON:* 
No. of Seq. differs, LENGTH : Input: 149 Found:150 SEQ:27 
Invalid Stop Code On Error, STOP CODON:* 
No. of Seq. differs, LENGTH : Input : 173 Found: 174 SEQ : 29 
Invalid Amino Acid Number in Coding Region, 
Invalid Amino Acid Number in Coding Region, 
Invalid Amino Acid Number in Coding Region, 
Invalid Stop Code On Error, STOP CODON:* 



SEQ ID: 30 
SEQ ID: 30 
SEQ ID: 30 



file://C:\CRF3\Outhold\VsrI876348.htm 



6/27/01 



VERIFICATION SUMMARY DATE: 06/27/2001 

PATENT APPLICATION: US/09/876, 348 TIME: 15:25:41 

Input Set : A:\SEQLIST.txt 

Output Set: N:\CRF3\06272001\I876348.raw 

L:1809 M;203 E: No. of Seq. differs, LENGTH : Input : 14 9 Found:150 SEQ-.31 
L:1932 M:333 E: Wrong sequence grouping, Amino acids not in groups! 
L:1933 M:332 E: (32) Invalid/Missing Amino Acid Numbering, SEQ ID: 33 
M:332 Repeated in SeqNo=33 

L:1938 M:342 E: Invalid Stop Code On Error, STOP CODON:* 
L:2047 M:333 E : Wrong sequence grouping, Amino acids not in groups! 
L:2048 M:332 E: (32) Invalid/Missing Amino Acid Numbering, SEQ ID:35 
M:332 Repeated in SeqNo-35 



L 


2056 


M 


342 


E: 


Invalid Stop Code On Error, STOP CODON:* 






L 


2101 


M 


336 


W: 


Invalid Amino Acid Number in Coding Region, 


SEQ 


ID: 36 


L 


2105 


M ■ 


336 


W: 


Invalid Amino Acid Number in Coding Region, 


SEQ 


ID:36 


L 


2110 


M 


336 


W: 


Invalid Amino Acid Number in Coding Region, 


SEQ 


ID:36 


L 


2114 


M 


336 


W: 


Invalid Amino Acid Number in Coding Region, 


SEQ 


ID:36 


L 


2118 


M 


336 


W: 


Invalid Amino Acid Number in Coding Region, 


SEQ 


ID:36 


L 


2122 


M 


336 


W: 


Invalid Amino Acid Number in Coding Region, 


SEQ 


ID:36 


L 


2126 


M 


336 


W: 


Invalid Amino Acid Number in Coding Region, 


SEQ 


ID:36 


L 


2130 


M. 


336 


W: 


Invalid Amino Acid Number in Coding Region, 


SEQ 


ID:36 


L 


2189 


M 


342 


E: 


Invalid Stop Code On Error, STOP CODON : * 






L 


2190 


M 


203 


E: 


No. of Seq. differs, LENGTH : Input : 17 3 Found 


:174 


SEQ: 37 


L 


2307 


M 


342 


E: 


Invalid Stop Code On Error, STOP CODON : * 






L 


2308 


M 


203 


E: 


No. of Seq. differs, LENGTH: Input : 149 Found 


:150 


SEQ:39 


L 


2528 


M 


204 


E: 


No. of Bases differ, LENGTH: Input : 481 Counted:484 SEQ:45 


L 


2594 


M: 


204 


E: 


No. of Bases differ, LENGTH: Input : 481 Counted: 484 SEQ: 46 


L 


2660 


M. 


204 


E : 


No. of Bases differ, LENGTH : Input : 481 Counted:484 SEQ:47 


L 


2687 


M . 


332 


E: 


(32) Invalid/Missing Amino Acid Numbering, 


SEQ ID:0 


M 


332 Repeated 


in SeqNo=48 






L 


2716 


M 


330 


E: 


(2) Invalid Amino Acid Designator, 1 






L 


2734 


M 


342 


E: 


Invalid Stop Code On Error, STOP CODON:* 






L 


2734 


M: 


330 


E: 


(2) Invalid Amino Acid Designator, 2 







M:342 Repeated in SeqNo=48 

L-.274 6 M:333 E: Wrong sequence grouping, Amino acids not in groups! 
L:2746 M:330 E: (2) Invalid Amino Acid Designator, 2 

L:2746 M:203 E: No. of Seq. differs, LENGTH: Input : 133 Found:320 SEQ:48 



file://C:\CRF3\Outhold\VsrI876348.htm 



